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The idea for this book was conceived a few years ago, at a meeting on the 
occasion of Monik Charette’s retirement from the School of Oriental and 
African Studies. Not only has Monik been a steadfast member of London’s 
phonological community over many decades, she was also a driving force in 
making it a community, to the extent that most who have pursued graduate 
studies or research in phonology at SOAS and UCL, or indeed other nearby 
establishments, might well have had the impression she might have been a 
member of any of the institutions at which they had just met her. 

Those who know Monik would undoubtedly agree that she is valued 
not only as a highly influential phonologist but also as an extremely charming 
and charitable person. Indeed, over time she has brought many students 
and scholars from all over the world to the field of phonology, particu- 
larly to research on phonological empty categories and licensing. Monik’s 
work has had a profound impact both on Government Phonology itself and 
those conducting research in the framework and its many descendants, as 
exemplified by many contributions in the present book. 

We (and we include all contributors to the book in this ‘we’) are indebted 
to Monik for helping us forge connections, both within the wider phonological 
community and between the myriad respective intellectual interests and ideas 
harboured by of each of us. The work presented here is dedicated to her: 


Momo, as our dear friend, mentor, and colleague, we cannot thank you 
enough — but let us at least express our profound gratitude by means of 
the phonology contained herein. 
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Principles and parameters 
in phonology: an introduction 
and overview 


Connor Youngberg, Yuko Yoshida, 
and Florian Breit 


This book is the culmination of state-of-the-art research in phonology from 
2019 to 2022, with an eye on both theoretical proposals and careful investiga- 
tion of individual languages. ! 

The work presented broadly covers segmental structure, prosodic 
structure, phonotactics and vowel-zero alternations, including traditional 
descriptive and analytical work, laboratory and experimental results, and 
generative and representational phonology. Much of this work is inspired 
by topics and representational issues discussed in Charette (1991) and other 
seminal work in the canon of Government Phonology (Kaye et al. (KLV) 
1990), Element Theory (Kaye et al. 1985), Strict CV (Lowenstamm 1996) 
and Dependency Phonology (Anderson and Jones 1974; Anderson and 
Ewen 1987), as well as Optimality Theory (Prince and Smolensky 2004). 
Naturally, given the breadth of this work we cannot give an exhaustive 
overview of all the theoretical approaches and frameworks involved here. 
For this we refer the reader to Dresher and van der Hulst (2022) as a general 
broad overview, as well as chapters from Ritter (2022) and Staun (2022) 
for those readers unfamiliar with Government Phonology and Dependency 
Phonology. Not all of the work presented here is theoretical, however, 
and some contributions provide new work oriented towards fresh data and 
experimental results. 

The contributions provide theoretical advances, new empirical evidence 
and analyses challenging previous generalisations, and we believe that the 
insights offered here will be equally exciting for phonologists working on 
related issues inside and outside frameworks with a focus on representa- 
tional phonology, for example, researchers working in Optimality Theory or 
classical rule-based phonology. 
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The principles and parameters approach: Government 
Phonology 


Government Phonology (GP) is a theory that can be broadly construed 
as a framework which explores the unification of forces from Principles 
and Parameters theory (Chomsky 1981; Chomsky and Lasnik 1993) with 
phonological structure, representation, and generation. The framework has 
its origins in early work on syllabicity from Kaye and Lowenstamm (1984) 
and Lowenstamm and Kaye (1986). These concepts were developed further 
in the programme of Government Phonology through the 1980s and 1990s, 
with early seminal works including Kaye et al. (1985, 1990), Charette (1991), 
and Harris (1994). More recent work continues to further the development 
of GP, such as the proposal that all syllables are composed exclusively of 
consonant-vowel pairs in CV phonology (Lowenstamm 1996; Scheer 2004) 
and the incorporation of structure in addition to elements or features into 
subsegmental representations (Jensen 1994; Rennison and Neubarth 2003; 
Poéchtrager 2006). 

The main premise of this programme is the idea that phonology and 
syntax may not be as radically different mentally as they were often proposed 
to be in contemporary work (cf. Halle and Bromberger 1989). Instead, 
Government Phonology proposes that many of the fundamental mechanisms 
and principles may be shared across the two, such as the presence of empty 
categories, government and licensing relations, and the regulation of the 
surface-complexity of constituent and segmental structure through parameter- 
isation and principles operating in conjunction with government and licensing 
relations. A further notable differentiating factor is the central role given to 
mental representations (see, for example, Harris 2007). 

The nature of segmental melody was similarly re-evaluated, here leading 
to a more radical departure from the notion that phonology is really about 
deriving ‘phonetic’ representations from more abstract ‘phonological’ repre- 
sentations (for an overview and current work on debates surrounding the nature 
of phonological primitives, see also Breit et al. 2023), with no significant level 
of phonetic substance being acknowledged at any level of phonological repre- 
sentation (Harris and Lindsey 1995). Instead of the common equipollent articu- 
latory features popularised with The Sound Pattern of English (Chomsky and 
Halle 1968), melody is represented by combination of more abstract, privative 
features known as elements, similar but not identical to the privative features 
found in Dependency Phonology (Anderson and Jones 1974; Anderson and 
Ewen 1987) and Particle Phonology (Schane 1984). Element Theory can now 
be considered its own representational framework which is used both within 
and outside of Government Phonology (Backley 2011). 
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A broad overview of frameworks in this tradition can be found in Scheer 
and Cyran (2018a,b), Scheer and Kula (2018), and Ritter (2022). 


‘Standard GP’: a brief overview 


As already mentioned briefly, major research themes in Government Phonology 
(GP) include the proposal that government and licensing relations regulate 
both syntax and phonology, the recognition of empty categories/constituents in 
phonology, and the parameterisation and restriction of structural representation 
and generation both at the syllable and segmental level. Standard versions of 
GP recognise the constituent categories onset, nucleus, and rhyme (where the 
rhyme is, strictly, a projection of the nucleus) — codas are notably not recognised 
as constituents in their own right. All of these are limited to binary branching, 
and segments are associated to a skeletal tier below these constituents. What is 
often most striking to non-GP phonologists are the syllable structures evidenced 
in this theory, such as those given below for the Southern British English 
words splay and sent. These include multiple empty constituents including a 
word-initial ‘coda’ in splay and a final onset preceding an empty nucleus rather 
than a complex coda in sent. Empty constituents are not distributed freely. 
They are highly restricted by conditions on government (on which more is 
found in the short introductions to the various parts of this book, for example, 
Chapters 7 and 14). Governing and Licensing relations regulate interconstituent 
and intersegmental relations, as well as the highly restricted — and potentially 
silent — empty nuclei. These relations are not shown in the below diagrams, 
but relevant concepts are discussed in the later chapters of this book and in 
the aforementioned references. Empty constituents and positions silenced 
through government or parameterisation are conventionally shown underlined 
in representations, as in the examples below and throughout this book. 


(1) Example GP structure for splay 
O R O 


| 
N N 
x 


——_ X 


i) 
T —xX 
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(2) Example GP structure for sent 
O R O R 


N 
| 
E€ 


What is apparent is that while the initial spl of splay may be a complex 
onset or an appendix plus onset structure in other frameworks, word initial 
s+C sequences and others with falling sonority are treated as an empty 
syllable with only a rhymal complement or ‘coda’ (Kaye 1992; Goad 2012). 
Word-final clusters with a final falling sonority cluster such as the nt of sent 
are treated as a rhymal complement (‘coda’) followed by an onset (Kaye et al. 
1990; Kaye 1990). This is supported by a wealth of evidence which implies 
that word-final consonants are in fact onsets of a syllable containing an empty 
nucleus (see, for example, Harris and Gussmann 1998, 2002). 

Some Standard GP analyses eschew the use of branching constituents 
and treat most or all consonant clusters as onsets surrounding an empty 
nucleus. Examples include work on Polish (Gussmann and Kaye 1993; Cyran 
2010), Japanese (Yoshida 1995), and Bemba (Kula 2002, 2008). Such a state 
of affairs might be viewed as the consequence of a parameter regulating 
constituent-branching traditionally, but has latterly given rise to a rich body 
of work within the Strict CV or CVCV approach (Lowenstamm 1996, 1999; 
Scheer 2004, 2012), which rejects constituent-branching and conflates the 
constituent and skeletal tiers. Under this view, all words are composed of 
strictly repeating Consonant-Vowel (CV) pairs and all consonant clusters 
enclose an empty V position, as shown in (3) for the English word trap. 


(3) CVCV representation of trap 
CVCVC WV 


t ræ p 


There are also several studies exploring the use of GP or CV structures combined 
with Optimality Theory (OT) (Polgárdi 1998, 2015; Rowicka 1999; Cavirani 
2015), while others employ a VC skeleton or a ‘loose’ rather than a ‘strict? CV 
skeleton (Dienes and Szigetvári 1999; Polgárdi 2008). 

Segmental representations in GP utilise elements, which are a set of 
privative features encoding relatively broad phonological characteristics or 
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acoustic categories (Kaye et al. 1985; Harris and Lindsey 1995; Charette and 
Göksel 1996; Backley 2011), though Element Theory can well be considered 
a separate framework of melodic representation which is now often used in 
work not subscribing to GP’s approach to prosodic structure. Most versions of 
Element Theory employ six elements: |A|, |I|, |U], |H], |L|, and |?|, which are used 
in both consonant and vowel representations. In vowels, |A|, |I|, and |U| broadly 
correlate with low, front, and round vowels, respectively, while |H], |L|, and 
|2| correlate to devoicing/breathiness, nasality, and creakiness. In consonants, 
|A], |I|, and |U| broadly correlate to the place categories coronal/pharyngeal, 
palatal/coronal, and velar/labial, while |H], |L|, and |?| broadly correlate to 
aspiration/frication, nasality/voicing, and occlusion/obstruency. What all of 
the elements have in common is that they may occur and be interpreted both 
independently (as a sole property of some vowel or consonant) as well as in 
combination with other elements to form more complex segmental representa- 
tions. For more details on Element Theory see Chapter 2. 


Recent developments and current questions 


In recent years, increased attention has been given to topics such as the 
accuracy of GP’s syllabification approach, how to deal with accent and stress 
in a framework which does not formally recognise feet and other metrical 
constituents, and how to best capture the varying character of segments in 
terms of their phonological behaviour and phonetic realisation. There are 
central debates as to whether syllabic and segmental structure can be further 
reduced or even unified, and there has been increasing recognition of the fact 
that while GP has been very successful in modelling representational aspects 
of phonology, the specifics of phonological computation and phonological 
processes as well as the analysis of tone and lexical stress currently leave 
much to be desired. The framework makes strong predictions and places 
heavy restrictions upon what may be expected to exist or how certain data 
ought to be understood, and more recent examinations of data new and old 
heavily challenge some of our theoretical assumptions. 

A large body of recent work examines how CV templates and segmental 
structure may be utilised at the morphosyntax—phonology interface, seeking 
to better understand what a morpheme is and how phonological representation 
can feed into that understanding. This includes but is not limited to work from 
Lowenstamm (2008), Bendjaballah and Haiden (2008), Bendjaballah (2012), 
Scheer (2012), Lampitelli (2014), Fathi and Lowenstamm (2016), Barillot 
et al. (2018), and Breit (2019). 

Another major avenue of current investigation concerns the question 
of whether syllabic and segmental structure can (and should) be further 
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reduced and unified. This typically means reducing the number of elements 
representing melody and accounting for the properties thus not independently 
encoded through (potentially more restrictive) structural means instead. The 
principal ongoing development in this direction is GP 2.0 (Živanović and 
Péchtrager 2010; Péchtrager 2020), which proposes that the set of elements be 
further reduced while more complex, embedded representations account for, 
for example, the effects of lowness instead, thereby linking them explicitly 
with length phenomena. Other proposals examining segmental structure in a 
similar vein include Precedence-Free Phonology (Nasukawa 2014; see also 
Chapter 22) and Onset Prominence (Schwartz 2012), which posit a richer and 
more fine-grained structural resolution of melodic representation, as well as 
Radical CV (van der Hulst 2020; see also Chapter 23), which is more closely 
tied to Dependency Phonology and reduces melodic representation to just two 
elements (C and V), encoding various complex melodic properties through 
the same rich, embedded structure and dependency relations that also obtain at 
segmental and syllabic levels. 


Outline of the present work 


Our book is divided into four Parts, reflecting major active research themes 
both in the GP programme and research on phonology as a broader field. 
Each begins with a brief introduction outlining the GP view of the broad 
themes treated and introducing the chapters contained therein. By adopting 
this ‘distributed introduction’ approach we hope to make the work presented 
here more accessible to readers who are less familiar with the framework. 
Part 1 focuses on melodic and segmental representations. Part 2 examines the 
role of prosodic and constituent structure in relation to processes and distribu- 
tions. Part 3 is devoted to the role of vowel-zero alternations, syncope, and 
epenthesis in various languages, and how the role of schwa can be understood. 
Finally, Part 4 considers the role of recursive structures in phonology. 

While each chapter addresses key issues and questions relevant specifi- 
cally to the framework and research programme we have outlined above, not 
all of these are framed within Government Phonology terms. Some take a 
theory neutral approach, focusing on empirical and experimental data which 
challenges commonly held phonological assumptions and generalisations 
taken to be true by many theoreticians, such as what a cluster in English is 
or what generalisations we can make about svarabakhti in Irish. Many of 
the chapters represent exciting new endeavours to develop and push theory 
further, including new work on the interaction of GP and OT computation 
and proposals for further suprasegmental parameterisation seeking to better 
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explain prosodic phenomena. We believe that the work presented here offers 
interesting ideas, findings, and data, not only for those fluent with the GP 
literature but also those who are new to the framework or simply curious 
about current issues, developments, and thought from various quarters of 
phonology. We hope that both the theoretical developments and the careful 
studies of French, Bavarian, Irish, Cypriot Greek, Japanese, and other 
language varieties contained herein will encourage further debate, discussion, 
and cross-theoretical interaction in works yet to come. 


Note 


1. This book and also this chapter are an equal-contribution editorship/authorship; there is no significance 
to the author order for either. 
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Melody and segmental 
representation: a brief introduction 


Florian Breit 


Introduction 


Element Theory (ET) has undergone many revisions since its first inception 
with Dependency Phonology (Anderson and Jones 1974; Anderson and Ewen 
1987) and Particle Phonology (Schane 1975, 1984) and subsequent adoption 
in early Government Phonology (Kaye et al. (KLV) 1985). For instance, the 
set of elements assumed has undergone much change. While mainstream ET 
has for some time now settled on a set of six features (|H, L, ?, A, I, UJ), early 
versions included, among others, the neutral element |@|, a noise element 
|b|, and a nasal element |N|. Yet in some current versions there is a drive to 
further reduce this set, typically by adopting structural conventions to encode 
relevant distinctions, such as the disposal of |A, H, ?| in GP 2.0 (for example, 
cf. Péchtrager and Kaye 2013). Variation also concerns other matters such as 
the role of headedness and dependency, the role of substance (see also van °t 
Veer et al. 2023), and representational organisation. For a more in-depth 
overview of variation within ET see Backley (2012). 


Consonant-vowel unity 


Although some elements, especially |?|, have not been linked to a consistent 
role within nuclei, ET assumes consonant-vowel unity (cf. Backley and 
Nasukawa 2010). That is, segmental melody is represented by the same 
set of elements regardless of the segment’s dominating category (that is, 
onset or nucleus). This carries the theoretically interesting implication that, 
although we might be able to link specific elements to typical phonetic 
exponents dependent on such categories, ET proper makes no use (or, has 
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no knowledge) of a distinction between such entities as vowels and 
consonants. 


Privativity 


Many theories of phonological primitives have a notion of multivalency. For 
example the feature theory of Chomsky and Halle’s (1968) SPE employs 
equipollent binary features such as [+high] and [—high], while other feature 
theories may have unary but explicitly matched (and thus in a sense 
equipollent) pairs of features, such as [compact] and [diffuse] in Jakobson 
et al. (1952). In contrast, ET assumes that all melodic primitives (viz. the 
elements) are strictly monovalent and privative. This means that in ET, the 
primitives have no ‘negative’ or ‘unmarked’ counterparts at any theoretically 
relevant level. Only the presence of an element within a melodic representa- 
tion is recognised, and the absence of an element is neither referenceable nor 
interpretable. 

Binary distinctive features have been a mainstay of work conducted in 
the tradition of linear phonology, such as Jakobson et al. (1952), Jakobson 
and Halle (1956), and especially Chomsky and Halle (1968). Such features 
have traditionally been employed not only to encode those categorial distinc- 
tions pertaining to the phonetic identity of phonological segments, but also 
to encode aspects such as stress, syllabicity, boundary information, and so 
on. As such, no explicit distinction was made between prosodic and melodic 
(that is, segmental) information. Later work however showed the need to 
formally distinguish these two levels of phonological representation, leading 
to the development of nonlinear models of phonology such as Autosegmental 
Phonology (Goldsmith 1972), which non-monotonically link melodic and 
prosodic information on different tiers of phonological representation, and 
this is the view taken by both Government Phonology and ET (cf. Harris 
2007). 


Interpretability 


Another important property distinguishing ET from most other theories of 
melodic representation concerns the interpretive requirements of melodic 
material. Whereas it is standardly assumed that the melodic output of 
phonology is interpretable only as a fully specified matrix of features in, for 
example, SPE-type theories, elements are understood as inherently interpret- 
able units, fully sufficient to receive phonetic interpretation in their own 


ELEMENTS, GOVERNMENT, AND LICENSING 


right (Lindsey and Harris 1990; Harris and Lindsey 1995). This means that 
any element, say |I|, is interpretable both as the sole melodic specification 
of a segment as well as in combination with other elements (for example, 
in the compound expressions |I, A| and |I, H, 2). An important consequence 
of this view is that an element-theoretic representation is interpretable at all 
times — there is no sense in ET in which for instance the underlying represen- 
tation of some form can be said to lack the necessary melodic specification 
to be phonetically interpretable, as would be the case with most SPE-type 
underlying representations that are heavily underspecified lexically and 
rely on phonological processing and feature-filling rules to reach a state of 
interpretability. 


The set of elements 


Current mainstream ET assumes a set of six primitives: |A, I, U| and |H, 
L, ?| (cf. Backley 2011, 2012; Kaye 2000). The former three are commonly 
known as resonance elements (also place or colour) and principally correlate 
with vocalic quality and place of articulation. The latter are often termed 
the manner elements because they principally correlate with manner of 
articulation (though that term is too strict to cover the range of their phonetic 
correlates). A brief summary of the six elements, their mnemonic names and 
principal characteristics is given in (1). 


(1) Element Name Characteristics 


Al mAss Lowness, uvular and pharyngeal place 
I| dip Frontness, palatal and coronal place 
U| rUmp Roundness, velar place 

H| high High tone, aspiration, frication 

L| low Low tone, voicing, nasality 

?| edge Noncontinuancy (‘stopness’) 


Segmental composition 


As mentioned, the elements may appear in melodic representations by 
themselves as well as in combination with one another. However, in addition 
to this, ET also recognises an empty representation (a necessary consequence 
of adopting the Empty Category Principle in GP) and recognises that 
elements in compound representations may enter into a head-dependent (or 
head-operator) relationship. 
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Empty representations receive a language-dependent default inter- 
pretation when not licensed to be silent (that is, to remain uninterpreted) by 
Proper Government. For nuclei this is typically a neutral central vowel such 
as /9/ or /i/. As is often the case, the situation has been less well investigated 
with consonants to date, but, for example, in German empty initial onsets are 
interpreted as a glottal stop. 

The head-dependent relationship that elements may enter into is much 
more restricted in ET than, for example, in classical Dependency Phonology. 
ET standardly allows only a single element within a melodic representation 
to assume head status (though Backley 2017 argues for multiple headedness), 
while the others remain as dependents. At the same time, melodic representa- 
tions do not have to be headed in ET, so that |I|, |I], |L, AJ, |L Al, and |I, A] (heads 
underlined) are all distinct melodic representations. 

Closely connected to this highly restrictive conception of a dependency 
relation (vis-a-vis segment-internal structural asymmetry) is that ET assumes 
that melodic representations permit isomerism (though always achiral) and 
operations thereon are idempotent, in the sense that (unlike, for example, in 
Schane’s Particle Phonology) an element is either present or absent from a 
representation without any notion of multiplicity and that two melodic repre- 
sentations are distinct if, and only if, either they are composed of different 
elements or the head of the representation is different (Breit 2013, 4-5). 
Consequently, compounding an element |I| into a representation such as |A, I| 
results in identical output, that is |A, I|, and while |A, Ij, |A, I|, and |A, I| are all 
distinct by virtue of the different head, there is no difference between notations 
such as |A, I| and |I, A| since they contain the same elements and have the same 
head (that is, order is irrelevant). 


Themes addressed in Part 1 


The chapters in Part | address a variety of current research themes in ET and 
its application. Bendjaballah (Chapter 3) contributes to our understanding of 
the internal make-up and behaviour of liquids, which have long presented 
puzzles for segmental phonologists, regardless of the assumed framework. 
In ET, it is often assumed that such liquids feature a combination of either 
|A, I| or |A, U| while |U] and |I| themselves function antagonistically to one 
another (see, for example, Backley 2011). Conversely, Bendjaballah demon- 
strates through an analysis of Bavarian /-localisation that the process can 
simultaneously show affinities suggesting that both |U] and |I| play an active 
role, with surface effects dependent on sequential licensing issues rather than 
segment-internal antagonistic relations. In doing so, she is able to present a 
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unified analysis of post-vocalic and post-consonantal environments in which 
/\/ lenites in these varieties and to make strong arguments suggesting that the 
behaviour seen is indeed in the grasp of phonological explanation, rather than 
having to be relegated to lexical encoding and diachrony. 

Turning his attention to final-obstruent devoicing in Polish, Cyran 
(Chapter 5) explores the separation of explanation across linguistic levels 
for certain marginal phenomena such as voicing-retention in the context 
of arguments for a more substance-free/relativistic approach to laryngeal 
categories (see, for example, Cyran 2014). Cyran argues that there are good 
reasons to believe that a separation of concerns is not only feasible but 
also desirable, showing that apparently conflicting phenomenology may be 
reconciled if we correctly consider their origin and distribution across the 
interface between segmental phonology and language-specific phonetics. 

Breit (Chapter 4) is more concerned with the theoretic characterisa- 
tion of ET itself, revisiting an earlier formalism (Breit 2013) and arguing 
explicitly for a derivational characterisation that draws out possible cognitive 
homology across linguistic modules. Extending previous work, he also 
seeks to formally integrate and characterise Licensing Constraints (LCs; 
for example, Charette and Göksel 1996), which opens up the possibility for 
further investigation of their role in the generation and distribution of various 
segmental systems. 

The issue of LCs and the encoding of restrictions commonly attributed 
to them is also taken up by Rennison (Chapter 6), who argues that we ought 
to give consideration to different explanatory hypotheses in light of LCs’ 
potential arbitrariness. Instead of arbitrary LCs, Rennison argues that we 
should revisit geometric constraint mechanisms by showing that these are able 
to encode a more conservative and simultaneously revealing set of restric- 
tions in the context of Turkish Vowel Harmony, one of the original sources of 
evidence for the modern theory of LCs. 
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On hedgehogs and gold in Bavarian: 
/-vocalisation in Upper Austrian 
German 


Sabrina Bendjaballah 


Introduction 


Liquids are known for their peculiar behaviour crosslinguistically and the 
question of their internal structure has been much debated in the literature. !? 
In this context, /-vocalisation in Bavarian has been discussed in different 
frameworks (for example, Kranzmayer 1956; Rennison 1978; Wiesinger 
1990; Kühnhammer 2004; Djabbari et al. 2010; Kaye and Péchtrager 2015; 
Noelliste 2017). This phenomenon refers to the fact that post-vocalic / in 
Standard German does not correspond to post-vocalic / in various Bavarian 
languages, for example, Upper Austrian German [gord] ‘gold’ versus Standard 
German Gold. 

This phenomenon however does not exclusively obtain in post-vocalic 
context: it is also observed in post-consonantal context, for example, Upper 
Austrian German [y:g1] ‘hedgehog’ versus Standard German Igel. This 
fact has largely been ignored in the literature. My aim in this chapter is 
twofold. First, I propose an analysis of the phenomenon in post-consonantal 
context. More specifically, applying an Element Theory analysis (Kaye 
et al. (KLV) 1985, 1990; Charette 1991; Backley 2011) to data from a 
particular Bavarian language (Weyer Bavarian, Oberösterreich, Austria), I 
show that the processes in post-consonantal and in post-vocalic context can 
be unified. Second, I examine the implications of the analysis on the status 
of /-vocalisation as a synchronically active phonological process in Bavarian. 
The analysis will attempt to disentangle the intricacies of the behaviour of |I| 
and |U|, which have been beautifully handled in Turkic languages by Charette 
and Göksel (1996). 

The three sections that follow present the data, the analysis, and then 
evaluate the implications of the analysis and conclude the chapter, respectively. 
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Data 


Introduction 


l-lenition? regularly takes place in coda position and results in [w] crosslin- 
guistically. The process is reported both synchronically and diachronically. 
The probably most widely discussed languages are English dialects, for 
example, milk realised in London English as [mrwk]. However this type 
of /-lenition is attested in many genetically unrelated languages; see, for 
example, Omani Mehri (Semitic, Modern South Arabian): [jəlú:bəd] versus 
[jowbé:d] /jolbé:d/, Vibd ‘hit’ 1pr.3ms versus sBJ.3ms (Bendjaballah and Tifrit 
2018). Z > [j] is less common, but it is also attested in various languages; see, 
for example, Standard Taqbaylit Berber [ali] versus Ouadhia Taqbaylit Berber 
[aji] ‘climb’ (Bedar and Quellec 2020). 

l thus presents an affinity both with |I| and |U|. The question arises as to 
whether these elements combine in the internal structure of /, or whether / has 
either |U| (for example, Omani Mehri), or |I| (for example, Ouadhia Berber). 
Backley (2011, 178) supposes that |U| and |I| ‘do not easily combine’ and 
proposes that laterals are composed either of |A| and |U], or of |A] and |I|.4 

It has long been noticed that the surface realisations of coda / in 
Bavarian involve both |I| and |U], as seen respectively in /gold/ [gord] ‘gold’, 
and /elf/ [œ:f] ‘eleven’ (for recent analyses in an Element Theory framework, 
cf. Djabbari et al. 2010; Bendjaballah 2012). However, the existence of 
the phenomenon as a synchronically active phonological process has been 
challenged in Kaye and Pochtrager (2015), who argue that the surface realisa- 
tions reflect the underlying representations: there is no /I/ in examples such as 
[gord] ‘gold’ and [œ:f] ‘eleven’, and therefore no /-lenition. 


Vowels 


In the literature, the discussion of /-lenition in Bavarian focuses on the 
interaction of coda /I/ with its preceding nucleus. Two subsets must be distin- 
guished depending on the identity of this nucleus. The first column in (1) 
exemplifies the phenomenon after a back vowel. The second column gives 
a related form where /I/, being in an onset position, surfaces as such, thus 
establishing its presence at the underlying level in the examples of the first 
column: 


(1) coda /I/ onset /I/ 
/w/ [fur] ‘school? [fulebue] ‘schoolboy’ 
/o/ [kom] ‘roll’.INF_ [so:led] ‘roll’ .cOND 
/3/ [fom] ‘fall’.ınF [fo:led] ‘fall’.COND 
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(2), below, exemplifies the phenomenon after a front vowel. As in (1), the 
second column provides related forms where /I/ surfaces. The underlying 
identity of the vowels in the forms in the first column of (2) is a matter of 
debate: Is it identical to the surface (y, œ, ø), or does it have the identity 
that can be seen in the corresponding Standard German forms (i, e, €)? The 
stem-vowel is identical in the forms with and without [l]. This casts doubt 
on the assumption of /i, e, e/. However, additional data support an analysis 
in terms of rounding triggered by /l/. For this reason, I assume rounding 
of /i, e, ¢/ to [y, ce, ø] for the time being, and will come back to this issue 
below. 


(2) coda /I/ onset /I/ Standard German 
fy] 4/ [gby:n] ‘play’.nr [gby:led] ‘play’.COND spielen 
[ce] /e/ [mæ:] ‘flour’ [meoe:li(g)] ‘tasting like flour’ Mehl 


{o]/e/ [fon] ‘peel’.nF  [fo:led] “peel’.COND schdlen 


The following distributional generalisation can be established: 


(3) Ifthe vowel preceding coda /I/ contains 
a. |U], but not |I| /1/ surfaces as [1] 
b. |I|, but not |U| // does not surface, and the preceding vowel is 
rounded.’ 


Diphthongs 


l-lenition after diphthongs is less often discussed, if at all. As illustrated in 
(4a), the off-glide preceding /I/ is neither [1] nor [v], but [q]. (4b) illustrates the 
fact that [ay] never surfaces if the consonant following the off-glide is not /I/. 
Following Kaye and Lowenstamm (1984), and many others, I assume that the 
internal structure of glides consists of |U| [w], |I| [j] and a combination thereof 
|U.I| [q]. An off-glide preceding /I/ is always realised as |U.]| [q]. 


(4) infinitive conditional Standard German 
a. defayn defayled ‘rot’ verfaulen 
dayn dayled ‘divide’ teilen 
fayn fayled ‘file, grate’ feilen 
b. warm Karbed ‘rub’ reiben 
savffa savffed ‘booze’ saufen 
savn savged ‘suck’ saugen 
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Consonants 


The question of the interaction of /l/ with a preceding consonant is 
remarkably absent, or at best marginal, in the literature on /-vocalisation in 
Bavarian.° 

The realisation of /l/ after a consonant depends on the identity of the 
consonant, exactly as was the case post-vocalically. The distributional facts 
are summarised in (5) and illustrated in (6) and (7). As shown in the second 
column of (6), if a vowel-initial suffix is added, /l/ surfaces as such. I take this 
to establish its existence in the underlying representation of the lexical items 
under consideration. 


(5) Ifthe consonant preceding /1/ is 
a. alabialoravelar(#y) — /I/ surfaces as [1] 
b. adental-alveolar (4 n, d) /I/ surfaces as [I] 


(6) infinitive conditional 
a. labial _ [auf dy:Bın] [auf dyßled] ‘fix, make stick’ 
[o: dayfin] [o: dayfled] ‘throw a fit’ 
[a6 hymmin] [4 hymled] ‘praise to the skies 


2 


b. velar _ [ku:gin] [ku:gled] ‘roll’ 
noun adjective 
[bu: gi] [bu: gled] ‘back’, ‘bumpy’ 
(7) dental-alveolar___ [fuxdln] ‘“gesticulate’.INF 
[muff] ‘mussel’ 
[fnits]] ‘escalope’ 


/n, d, y/ must be excluded in (5) because they trigger additional phenomena: 
n+l is realised as [ndl]; the acoustic realisation of {y, d}+l requires a 
detailed phonetic analysis. In this chapter, I will transcribe the resulting 
segment as L and leave the question of its exact realisation for further 
research.’ 

Finally, the realisation of /I/ after palatals cannot be tested, since Upper 
Austrian German does not have underlying palatal obstruents: [¢] is the 
realisation of an underlying velar. /l/ preceded by [ç] behaves as expected: it 
surfaces as [1], for example, [syçç1] ‘sickle’. The only context after palatal is 
thus after an off-glide, cf. (4). 

The affinity between labials and velars has often been noted in 
Element Theory, and it has been ascribed to the presence of |U| in both 


ELEMENTS, GOVERNMENT, AND LICENSING 


classes of consonants. (See for example, Backley 2011, 81ff.: ‘this [...] 
supports the idea that velars and labials are phonologically related [...] this 
relation is based on the presence of a shared |U| element’.) Assuming in 
addition that the only consonant with |I| in Upper Austrian German is [1], 
we conclude that 


(8) Ifthe consonant preceding /1/ 
a. contains |U] but not |I| /\/ surfaces as [1] 
b. contains neither |U| nor |I| /1/ surfaces as [l] 


Note that (8) is not sensitive to the morphological status of /I/: it applies 
exactly in the same way in cases where /I/ has affixal status, as illustrated 
in (9). Diminutive -/ is realised as [1] after labials and velars, and as [1] after 
dental-alveolars. 


(9) base /-diminutive 
labial__ [ko:pf] [keppfi] ‘head’ 
velar _ [duex] [dyeççı] ‘cloth’ 
dental-alveolar_ [hbě:n] [hbě:ndl] ‘hen’ 


Action at a distance 


/l/ also interacts with a segment located at a distance. The facts are exemplified 
in (10). The forms in the first column are /I/-final nouns; the forms in the 
second column are diminutives, that are built by suffixation of -erl. As noted 
above, I transcribe the surface realisation of /I/ as [L] while the exact phonetic 
character remains to be determined. 


(10) stem- /-final erl-diminutive 
vowel noun 
a. y~i labial [dy:r] [di:BeL] ‘dowel, bump’ 
velar___—[y:gt] [i:geL] ‘hedgehog’ 
b. o~e labial [gZdombbr] [3dembbeL] ‘stamp’ 
velar__—s_ [dag] [de:geL] ‘container’ 
cë o labial — [kno:fi] [kno:feL] ‘garlic’ 
velar__ [fo:g1] [fo:geL] ‘bird’ 
u labial = [zdumm] [zdu:mmeL] ‘short cylinder’ 
velar__ [ku:g1] [ku:geL] ‘ball’ 


The internal structures of the vowels are summed up in (11): the vowel 
in the /-final noun systematically contains an additional |U]. (If the vowel 


ON HEDGEHOGS AND GOLD IN BAVARIAN 


23 


24 


contains |U], this addition is not reflected on the surface, and no alternation 
is observed.) 


(11) VC-I (noun) VC-eL (diminutive) 


y [U.I i Ir 
Ø, Œ |A.U.I| e, E€ |A.I| 
o |A.U| 
u IU] 


I propose to interpret these facts as follows: 


(12) a. gL-diminutives make it possible to recover the underlying quality 
of the stem-vowel. 
b. Final /l/ triggers rounding (+|U]) of the stem-vowel. 


In order to establish (12), we must first exclude the outside interference of an 
independent, morphologically conditioned, phenomenon: Umlaut. Umlaut in 
Upper Austrian German obeys (13): 


(13) [U| > II] 
As seen above, Upper Austrian German has two diminutive suffixes: -/ and 


-eL. -l is not productive and is used in a closed class of lexicalised items; -eL 
is more productive. Diminutive -/ always triggers Umlaut: 


(14) base /-diminutive 
a. /u/—i/_I,,, [vueft]  [vieftl] ‘sausage’ 
[havs] — [hars]] ‘house’ 
/o/ > e/_I,,, [ko:s] — [wessl] ‘horse’ 
b. i—i [kind]  [kind]] ‘child’ 
e—e {hé:n] [hě:ndl] ‘hen’ 


Now note in (10) that /-final nouns always build their diminutives via 
suffixation of -eL to the nominal base without final /1/. The fact that final /1/ 
is systematically absent in the diminutive suggests that it functions as a (class 
marker) suffix: / has affixal status in /-final nouns. The question thus arises 
as to whether it triggers Umlaut. Such is not the case in (15). Diminutive 
-l triggers Umlaut while class marker -/ does not. 
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(15) stem-vowel /-finalnoun Umlaut 
HUI, | 


oe 


Wwe | 


Diminutive -eL triggers Umlaut for some roots, only (16a versus 16b).° 


(16) base eL-diminutive 
a. /u/—i/_eL,,, [dumm]  [dsimmeL] ‘large lump’ 
/o/ —>e/_eL pu [koeb] [keeBeL] ‘basket’ 
b. u—u [hund] [hundeL] ‘dog’ 
o—o [bo:t] [bo:teL] ‘boat’ 
c. i—i [fi:¢] [fi:ceL] ‘animal’ 
e—e [ve:g] [ve:gaL] ‘path’ 


The question is whether [i], [e] in the diminutives in (10a,b) result from 
Umlaut. If yes, the underlying identity of the corresponding stem-vowels is 
/u/, /o/ respectively. This leads to the pattern in (17): the underlying vowel 
of a word like dy: fr ‘dowel’ is /u/, and final /l/ adds |I| to this vowel. [i] in 
the corresponding diminutive (di:feL) results from Umlaut. By contrast, 
in a word like ku: gr ‘ball’, final /1/ does not add |I|, and the corresponding 
diminutive does not undergo Umlaut. 


(17) _/final N ; eL-diminutive 
/u/ y (+I) :i (+Uml) l [dy:ßB1] [di:BeL] alternating 


lol ø ŒI) e (+Um) = [døg] [de:geL] 
h : u : [ku:gr] [ku:geL] non-alternating 
/o/ o [fo:g1] [fo:geL] 


We are faced with a choice between the following options: either we maintain 
a unified behaviour of final /I/ (/I/ always has the same effect on a given vowel 
quality), or we maintain a unified behaviour of roots (certain roots always alternate, 
others never alternate, and this feature has to be specified in the lexical entry of the 
root). I adopt the first option: the effect of /I/ is a regular, predictable, phonological 
process, and the diminutive makes it possible to recover the underlying identity of 
the stem-vowel. Under the second option, underlying /u, 0/ in words like dy: pr, 
dø: gt would never be recoverable from any form in the language. 
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I conclude that [i], [e] in the diminutives in (10a,b) do not result from 
Umlaut, but are the underlying stem-vowels. Diminutives make it possible to 
filter out the effect of final /1/ on the stem-vowel and reveal the underlying 
vowel quality. The non-diminutive forms reveal the effect of /1/ on this 
underlying vowel: rounding. !° 


Conclusions 


/\/ has both a local effect, and a distance effect: 


(18) Local effect: 
a. After a segment (C/V) that contains |U| but not |I|, /I/ surfaces as [1] 
b. After a segment (C/V) that contains |I| but not |U], /I/ does not 
surface, and triggers rounding. 


(19) Distance effect: 
In V(:)Cl#, if C contains |U], /l/ surfaces as [1], and triggers rounding 
of V. 


Analysis 


The representations to follow will be cast in the CV-framework of Government 
Phonology (Lowenstamm 1996). (20) provides a summary of the empirical 
generalisations the analysis has to account for. 


(20) Generalisations: 

a. /-lenition takes place > 
— /l/is not licensed by its nucleus. 
— the immediately preceding segment contains either |I|, or [U]. 

b. Ifthis segment contains |U] but not |I], /1/ surfaces as [1], and 
triggers rounding to its left, up to V, ina V,C/ sequence. 

c. Ifthis segment contains |I| but not |U], /I/ does not surface, and 
triggers rounding of the immediately preceding segment. 


(20a) pertains to the conditions under which /-lenition takes place. If /l/ is 
followed by its nucleus, it is licensed, no matter what the internal structure of 
the preceding segment is (21a). If /I/ is not licensed by a nucleus with vocalic 
content, lenition is triggered (in the spirit of Harris 1994).'! The presence of |I| 
and |U| in the preceding segment conditions the realisation of /1/. I take this to 
indicate that /I/ contains at least |I| and |U| (cf. Djabbari et al. 2010 for a similar 
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assumption), and that /-lenition involves dissimilation: if /l/ shares part of its 
internal structure with the preceding segment, /-lenition obtains. If /1/ does not 
share part of its internal structure with the preceding segment, it is realised 
as []] (21b). 


21) a. ; 
re Lic : 


C VCVCV VCV CVCVCVCV 


IIl II| 
IU} JU] Ul lU] lU] 
X |A| Y 
b u g l ad m u S l 
[buglad] ‘bumpy’ [muffl] ‘mussel’ 


(X and Y represent any other element in the internal structure of C.) 


Let us now consider (20b-c). /-lenition is a partial (20b) or total (20c) disas- 
sociation of the internal structure of /1/. Which part is disassociated depends on 
the identity of the segment immediately preceding /1/:!* 


(22) a. Ifthe preceding segment contains |I|, |I| is disassociated and 
disappears altogether. Otherwise, |I| remains associated. 
b. |U| is always disassociated and spreads to the left. 


The configurations after vowel and consonant are represented in (23) and (24) 
respectively. (23/24a) show that if the preceding segment contains |U], only |I| 
remains associated. I assume |U] to disassociate from C and spread to the left, 
even though |U|-spreading is not visible since the preceding segment contains 
|U]. (23/24b) illustrate the situation after a segment containing |I|, namely |I| 
is disassociated, and |U| spreads to the left. |U|-spreading is visible on the 


surface. 
(23) a ecvevevbecveveveyv 
(|I| disassociation) [I| [Il II| 
|U| disassociation = ' 
|U|-spreading |U] << JU] IU] << |U| 

J 3 b 

[fu:1] ‘school’ [3by:n] ‘play’ 
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(24) a C VC VC V b C VCVC V 


(|I| disassociation) \ I1 I1] 
|U] disassociation |X| = ' 
|U|-spreading [U| << |U] IU] IA] IÙ] << |U] 
b u g 1 d 
[bu:g1] ‘back’ [dayn] ‘divide’ 


Finally, the configurations where |U| spreads at a distance are represented in 
(25). Note that for this process to take place, the intervening segment must 
contain |U]. This supports an analysis in terms of spreading rather than harmony. 


(25) Cc VC VC Vv 
I ji 
|U] disassociation i = 
|U|-spreading |U] << |U| << JU] 
IXI 
d i b 1 


[dy:B1] ‘dowel’ 


(26) 


|U| disassociation 


|U|-spreading 


Cc V C Vv Cc Y 


VIS 


i i 


IU] << |U] << |U] 


|A| lY| 
d e g i 


[dø:g1] ‘container’ 
According to this analysis: 


e |I| must be disassociated if preceded by |I|. I take this to result from an 
OCP/dissimilation effect. (For a different instantiation of liquid dissimi- 
lation in Bavarian, see Hall 2009.) 

¢  |U| behaves uniformly: it must always disassociate, and spread as far as 
it can to the left. Note the peculiarity of |A], that resists rounding in the 
first part of the diphthong in (24b). 


Dissimilation does not apply between labials/velars and preceding 
|U|-vowels, as can be seen, for example, in [bu:g1] ‘back’. Neither does it 
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apply between j and preceding |I|-vowels, for example, [fijssn]!* ‘shoot’. 
Getting back to Backley’s observation that |I| and |U| do not easily combine, 
I propose that the dissimilation of / should be ascribed to this ‘difficulty in 
combining’: Upper Austrian German shows that in a language where |I| and |U| 
combine within a segment, this segment is unstable, and its internal structure 
is subject to disassociation. Note that in Upper Austrian German, /-lenition 
does not imply loss of elemental content altogether. Rather, the information on 
the internal structure of /I/ is recoverable from the context. 


Conclusion: a phonological process? 


We are now in a position to evaluate the status of /-lenition in Upper Austrian 
German: is it a synchronically active phonological process? 


The problem 


Recall that the absence of vowel-alternation in pairs like [meoe:] ‘flour’, 
[mæ:lig] ‘tasting like flour’ casts doubt on the assumption of underlying /I/ in 
this type of forms: do we need to posit underlying /e:1/ in [moe:], or is /œ:/ the 
underlying vowel? Kaye and Pochtrager (2015) argue in favour of this second 
option. They state that both rounding of the preceding vowel and ‘the case 
of intrusive |’ are not phonological: ‘schnell [fnœ:] ‘fast’ / schneller [fneele] 
‘faster’ are different lexical entries [...] and thus not derived from each other’. 
I do not question the fact that schnell and schneller are two lexical entries — all 
forms are lexical entries, even if they are productively derived from another 
form. However, this does not mean that we must get rid of phonology entirely, 
and that the underlying representations are identical with the surface realisa- 
tions of the lexical entries under consideration. 


The argument 


The behaviour of /l/ after a consonant reveals two salient facts that suggest 
that the issue should be revisited, and that a phonological analysis should 
not be discarded. First, the realisations of /I/ obey the same generalisations 
post-consonantally and post-vocalically. Second, /l/ triggers a stem-vowel 
alternation at a distance, whose pattern is identical with the one observed 
locally in (post-vocalic) coda position. The fact that the processes after 
vowel, after consonant, and at a distance obey the same regularity supports 
the analysis of /-lenition as a phonologically active phenomenon. A lexicalist 
view seems to miss this parallelism. 
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In addition, the process applies irrespectively of the morphological 
status of /I/: diminutive -/ and class marker -/ behave on a par with respect 
to the realisation of /l/ and rounding (crucially, not with respect to Umlaut). 
I take this fact to indicate that /-lenition is a phonological process. 

Finally, Kaye and Péchtrager (2015) note that ‘Crucially no kind 
of suffixation would help to recover the correct UR, acquisition is only 
possible if child knows the standard’. eL-diminutives systematically reveal 
the underlying identity of the stem-vowel. The child does not need to know 
Standard German to decide on the identity of the stem-vowel: s/he knows 
that [dg:gr] ‘container’ alternates with [de:geL] ‘small container’, and that 
[dy:B1] ‘dowel, bump’ alternates with [di:BeL] ‘small dowel, bump’. One 
can hypothesise that s/he will learn this rule quite early since diminutives are 
particularly widespread in baby talk. 


Notes 


1. This chapter is dedicated to Monik Charette whose inspired work on vowel harmony, elements, and 
licensing has shaped phonological thinking in and beyond the Government Phonology framework. 
I thank Connor Youngberg for inviting me to contribute to this book, and for valuable comments on 
this chapter. 

2. Abbreviations: INF = infinitive, COND = conditional, DIM = diminutive, IPF = imperfective, sBJ = 
subjunctive, N = noun, ADJ = adjective. 

3. In this chapter I use ‘/-lenition’ instead of ‘/-vocalisation’: as will become apparent, /I/ does not 
always surface as a vowel in the data under discussion. 

4. See Kaye et al. (1985, 308) and subsequent work for the marked character of the combination of |I| 
and |U]. 

5. Upper Austrian German does have [a], for example, [ba:m] ‘tree’. Interestingly the only [al] 
sequences I could find appear in loanwords, for example, [a:1] ‘eel’. Upper Austrian German [1] thus 
seems to be incompatible with preceding |A]. I get back to this observation in Note 8. 

6. Djabbari et al. (2010) note that ‘there also exists a vocalic variant of syllabic /I/ which occurs after 
labial consonants.’ However, they do not go into more detail concerning the distribution and the 
phonological effects of this ‘vocalic variant.’ 

7. See, for example, Kranzmayer (1956, 124), Hall (2009), and Noelliste (2017, 167) for the realisation 
of this segment in different Bavarian languages. 

8. ə~ a alternations are also observed in (i). However, they cannot straightforwardly be analysed as /a/ 
being rounded to [9] in the context of /I/, because they also obtain in the absence of /I/ (ii). 


noun diminutive 
(i) /-final [go:Br] [ga:BeL] ‘fork’ 


fopft]  [apfeL] ‘apple’ 
(ii) not/-final [ho:z] [ha:zeL] ‘rabbit’ 
[bo:x]  [baxxeL] ‘stream’ 


These alternations involve additional complications related to the status of |A| in Upper Austrian 
German. Standard German [a] and [al] regularly correspond to Upper Austrian German [o] and [ə1], 
for example, [dax] versus [dəx] ‘roof’, [bal] versus [bor] ‘ball’. However, Upper Austrian German 
does have [a], for example, [ba:m] ‘tree’. It thus seems to be the case that Upper Austrian German 
[I] is incompatible with preceding |A|. In addition, |A| resists the process of rounding at a distance 
triggered by /l/: dayn, *doyn ‘divide’.INF, kaSm / kamp1,*k3mpi ‘comb’ / ‘comb’.piM. Clearly, 
more research on the status of |A| in Upper Austrian German is needed. For various cross-linguistic 
generalisations on |A], cf. Péchtrager (2006). 
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9. In addition, both suffixes sometimes trigger stem-vowel shortening. This alternation is outside the 
scope of this chapter. For details and an analysis in terms of Lowenstamm (2008), cf. Bendjaballah 
(2008). 

10. If rounding is a phonologically active rule, it should not be restricted to /-final nouns: it should 
apply irrespectively of the morphological status of /1/, and we expect it to take place in (lexicalised) 
/-diminutives, too. Given the context during which this chapter was written, I could not conduct 
fieldwork and test this prediction. For a recent discussion of ‘vowel dissimilation’ in Bavarian, see, 
for example, Noelliste (2017, 146ff.). 

11. On licensing conditions, see among many others Harris (1990), Charette (1991), Cyran (2010). 

12. See Charette and Göksel (1996) for the analysis of various harmony processes in Turkic languages 
where |I| and |U| are involved in an asymmetrical fashion. 

13. See also Note 8. 

14. Some native speakers perceive [j] as a lower back vowel in this form, and would transcribe it as [e]. 
This is probably related to the peculiar status of |A| in Upper Austrian German, and its resistance to 
rounding, cf. Note 8. 
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ELEMENTS, GOVERNMENT, AND LICENSING 


4 
Sets of (sets of) elements 
Florian Breit 


Introduction 


In this chapter, I propose a revised formalisation of Element Theory (ET; 
Backley 2011; Charette and Göksel 1998; Cyran 1995; Harris and Lindsey 
1995; Kaye et al. (KLV) 1985; Lindsey and Harris 1990), building on the model 
in Breit (2013).! I propose revisions both to the structure building process and 
the architecture of an ‘element-theoretic grammar’, that is, a grammar of 
melodic representations (MRs).? This aligns the model more closely with a 
shift away from templatic structure and (asymmetric) pair-merge in minimalist 
syntax (cf. also Ulfsbjorninn 2021), forming a base-point to discuss cross- 
modular homology — an aspect that has long been a driving force behind 
developments in Government Phonology (Chapter 1 of this book; Kaye 1990; 
Kaye 2012; Péchtrager 2006; Pochtrager and Kaye 2013; cf. also Bromberger 
and Halle 1989). The architectural reframing also allows for a unified account 
of Licensing Constraints (LCs; Charette and Göksel 1996, 1998; Kaye 
2001), “parameterised’ elements (Cyran 1996) and (tentatively) elemental 
antagonism (Backley 2011, 2017). The chapter explores central aspects of the 
revised model, its correspondence with Standard ET and some variants, and 
discusses important questions and insights framed by formalisation. 


Element-theoretic assumptions 


There are many variants of ET, both historically and in current theory 
development. While this certainly makes an extended exposition of specific 
assumptions desirable, due to limited space I can but give an extremely 
terse summary of my assumptions here. I take these to describe what I term 
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‘Standard ET’, a (perhaps conservative-leaning) version I take to be represent- 
ative of the mainstream ET baseline over the last decade or so. Breit (2013, 
3—16) provides a more detailed technical exposition and Backley (2011) gives 
a comprehensive introduction to the theory (though I do not subscribe to 
multiple headedness and remain sceptical of admitting phonetic substance). 
Backley (2012) gives an overview of commonalities and differences across 
various version of ET. See also the overview in Chapter 2. 

Stenographically, I assume: (i) consonant-vowel harmony, that is, the 
same primitives are active in consonantal and vocalic representation (cf. 
Backley and Nasukawa 2010); (ii) the set of primitives (called elements) 
consists of |H, L, ?, A, I, Ul; (aii) elements are strictly privative and 
monovalent; (iv) elements are interpretable both independently and combined 
with other elements (Independent Interpretability Principle); (v) the empty 
representation | | is well-formed and interpretable; (vi) MRs have exactly one 
head or none (Single Optional Headedness Condition, SOHC); (vii) MRs are 
achiral, that is, they are distinguished solely by the elements occupying head 
and complement position. 


Preliminary considerations 


It is relatively obvious that the set of all possible MRs defined by Standard ET 
is finite, given that the terminal vocabulary is finite, repetition is not allowed 
(|A, A| = |A], or *|A, Al; pace Schane 1984), and MRs do not permit infinite 
embedding (recursion; pace Backley and Nasukawa, Chapter 22; Backley and 
Nasukawa 2020; Ulfsbjorninn 2021). 

The point is that MRs are rather simple compared to, for example, 
syntactic representations. This makes it relatively trivial, and tempting 
perhaps, to formally define the set of all possible MRs by equally simple 
means. The model I propose may thus at first seem unreasonably complex, 
but the reasoning is easily made apparent by briefly entertaining two (overly) 
simple approaches. 

The least complex but formally satisfactory definition of the set of 
possible MRs (that is, the formal language L(ET) defined by ET) can be 
given by simply enumerating all its members. Assuming the elements 
|H, L, ?, A, I, U| and short Kuratowski pair definition’ to represent head and 
complement (cf. Breit 2013, 21ff.), we might give L(ET) as shown in (1) 
(including only |A| and |I| for brevity): 


(1) LET) = {@, {A}, {A, {AS}, (D, {1 {D}, {A, I}, {A, {A,B}, 
{1, {A,B}, ... }. 
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The enumeration approach is obviously facile, being neither helpful nor 
insightful in any significant way. Apart from assigning some unique and 
templatically consistent set of representation to each MR defined by ET, we 
have learned nothing about the underlying cognitive system we are attempting 
to model (see also the criticism of templatism in Ulfsbjorninn 2021). 

Despite the relative brevity (|LCET)| = 256) ofan enumerated definition 
of L(ET), we could of course define a simple formal grammar Gpr = ( Vp Vy 
S,R) instead, with terminal vocabulary V, = {A,1,U,H,L,?}, non-terminal 
vocabulary V,, (consisting solely of the start symbol S), and a set of rewrite 
rules R, given cursorily by 


S 
(2) R=345S > anyX € (v) i 
any X € (Vz) > any {Y,X} such that Y € X 


Applying these rules to derive sets which do not contain non-terminal S from 
the set V,U V,, in the usual way, Gpp generates the language L( Gp) illustrated 
in (3) — which is of course identical to (1). 


(3) L(Gpr) = {@, tA}, {A, {AS}, 1B, {1 {D}, {A.T}, {A, {A,B}, 
{L {A, }},...}. 


This is conceptually more pleasing than enumeration because it makes the 
structure building process explicit: the first step always involves rewriting the 
start symbol with any set in so( V,), followed by an optional step of rewriting 
this set as one consisting of itself and any of its members (essentially internal 
merge). We can take G,,, as a model making claims as to the psychological 
reality of an intensional system, thus potentially furnishing actual insight 
about MRs. The model also allows us to engage in model comparison, asking, 
for example, how L( Gpr) would be affected if we modified V, or N in one way 
or another, thus being theoretically helpful. 

The way in which the second model falls short is in its content, that 
is, in the mechanism from which we have constructed Gpr which is not rich 
enough in detail and speciation to model the actually postulated underlying 
mental system. Consider for a moment the fact that |L| and |H| cannot combine 
in many languages. To encode this fact we would have to replace the very 
general rule S > any X € (V7), breaking it down in such a way that a 
specific language might show an absence of whatever rule allows the pairing 
{L, H}. We then consider that in yet many other languages, |U] and |I| cannot 
combine, that in certain languages such combinations are possible when one 
is the head, etc. etc. Eventually we must arrive at a system which breaks down 
the general patterns of these rewrite rules in such a way that we do not capture 
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a large proportion of the insights that are to be made about MRs but rather 
end up with something resembling the enumeration of possibilities rather 
than modelling the mental processes by which they are built. For instance, 
as discussed further in the section ‘The constraint set C’, the set of possible 
restrictions upon MRs itself is very much a system governed by rather few 
general yet well-constrained mechanisms. 


The revised model 


Given the discussion above, a conceptually more satisfactory and insightful 
model of ET must strive toward explanatory adequacy, that is, the aim must 
be to closely model the hypothesised mental mechanisms of ET and their 
integration within the cognitive architecture of the phonological component. 

Conceding that ET is but one part of a more complex phonological 
theory (whether it be Standard GP, Strict CV, or some other theory), often 
quite tightly integrated with other aspects of that theory, I propose that we 
can nonetheless fruitfully model it as a specific subcomponent of phonology. 
To stretch the computational metaphor, it seems apt to characterise ET as a 
relatively discrete submodule of phonology implementing processes to build 
and carry out computations over MRs, exposing a part of this functionality 
to its parent module (providing a ‘public API’ of sorts) while keeping certain 
parts of its workings private. That is, ET provides phonology with a system to 
(a) derive MRs from a symbolic vocabulary V, (b) relate and map MRs, and 
(c) exclude MRs of certain types. 

By way of an overview before diving into more detail, what I propose 
is that a grammar G modelling ET is a triplet (V,R,C) consisting of: (i) a 
universally fixed vocabulary of atomic symbols V (the set of elements), (ii) 
a set of relations R, also universally fixed, by which MRs can be derived 
from other MRs following specific principles, and (iii) a language-specific 
constraint set C, which delineates the set of well-formed MRs in L(G) by 
means of licensing constraints. Given such a grammar, we define È as the 
set of all MRs derivable from V by means of R (respecting the principles of 
application), and a language L as a subset of © such that L contains all the 
MRs derivable from V except those in C (that is, L(G) = X — Cg and L(G) = 
x e Co = Ø; note also that L(G) with Cg = Ø essentially = L(G,7) = 
L(ET) above). 

Note again that I assume that G characterises not just an output 
L(G), but models a subcomponent which other phonological processes can 
interact with (via the set of relations R). I also assume the absence of non- 
terminal symbols, that all relations map like-to-like (MR + MR) and that 
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the only language-specific part of the grammar are the constraints on MR 
composition encoded in C, which suggests that L(G) for any given language is 
(potentially much) greater than the set of MRs actually employed in its phono- 
logical representations. 


The vocabulary 


A major variable factor across versions of ET are the elements assumed. 
Apart from the labels we may attach to these atoms, the only significant way 
in which the set of vocabulary items affects the grammar is the generative 
capacity GC of G, since |GC(G)| « [Z| œ |V|. Independent of the specific 
elements assumed we state, 


(4) Given a grammar G = (V,R,C), let V be a finite set of atomic 
symbols. 


However, for each specific version of ET it is generally held that V is 
universal, that is, identical across human phonologies and not subject to 
language-specific variation. The generalised version in (4) is thus of interest 
if comparing versions of ET which assume differently sized sets of elements 
only. For all Gs implementing Standard ET, V is defined by 


(5) V = {A,LU,H,L,?}. 


A possible exception to the universality assumption is Cyran (1997, 2010). He 
proposes that some elements (specifically, the noise element |h|) may be para- 
metrically present or absent in some languages. While this can be implemented 
by varying V parametrically (as, for example, in Breit 2013), I propose that it 
is best to keep the assumption that the set of elements is a universally fixed 
part of the underlying cognitive machinery. As shown in this chapter’s section 
‘The constraint set C’, ‘parametric elements’ are already accounted for as part 
of the language-specific constraint mechanism. 


Melodic representations 


The primary objects that an element-theoretic subcomponent deals with are 
MRs, more than individual elements. As foreshadowed, I here espouse an 
explicitly derivational view. MRs are sets constructed from members of V 
according to a specific set of principles (or axioms). As Ulfsbjorninn (2021) 
rightly criticises, many recent approaches to ET take an essentially templatic 
approach where segmental representations are but a schematic structure with 
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slots that can be populated as needed. Not only does this miss potentially 
important insights (such as the hypothesis that an element in the head position 
of an MR may imply its phonetic interpretation as a complement; cf. Breit 
2017; Ulfsbjorninn 2021), it also prevents us from making observations about 
the proposed underlying cognitive processes. For instance, while Breit (2013, 
21) explicitly links the proposed structures to those produced by the syntactic 
adjunction operation (pair-)merge(a,f8) = {a, {a,B}} (Chomsky 1995), the 
derivational process is not further elucidated. 

I propose to revise Breit’s (2013) account in two important ways. First, 
I follow Ulfsbjorninn (2021) in assuming only set-merge as the underlying 
structure building operation (in line with other work proposing to eliminate 
pair-merge from syntax, cf., for example, Oseki, 2015). Second, I propose 
that relations in ET uniformly map MRs to MRs. The latter modification gives 
us the Independent Interpretability Principle for free, since bare elements are 
never exposed. 

Concretely, I propose that MRs are built by the (set-)merge operation, 
defined in (6). 


(6) merge(a.B) = {a,B}. 


Set-merge is one of the simplest possible cognitive operations, being not 
much more than ad-hoc categorisation. Of course, in syntax merge may apply 
recursively (potentially ad infinitum), whereas MRs are severely limited 
structurally. Per the SOHC, an MR in Standard ET consists minimally 
of a subset of V, and maximally of a subset of V plus one element in 
head position. If we want to capture this property of standard ET (rather 
than propose a new infinitely embeddable variant as in the Bare Element 
Geometry of Ulfsbjorninn 2021), an interesting question arises: If both 
syntactic and melodic representations are built by the same operation, how 
is it that syntax ends up with strictly binary branching trees (for example, 
{W, {X, Y}}) while MRs may be ‘imbalanced’ (for example, {X,Y,Z} or 
{W, 1X, Y,Z}})? 

It has been suggested that the binarity of merge in syntax is a domain- 
specific restriction, perhaps driven by computational requirements (for a brief 
discussion see, for example, Chomsky 2005, 11ff.), though n-ary merge might 
not be an insensible choice for ET. After all, ad-hoc categorisation is generally 
constrained more by factors such as subitisability and working memory than 
some need for symmetry. Even so, I propose to maintain the assumption that 
melodic structure building is based on binary (set-)merge. We need not allow 
an arbitrary number of objects to combine (for example, merge(a.,,...,4, 
Opni, ) to explain ‘imbalanced’ MRs. 
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There is an alternate candidate for the apparent imbalance in MRs, 
namely the numeration. The numeration v is a subset of items drawn from 
V at the start of a derivation, containing the items to be combined during the 
derivation. Since at least for ET, V consists only of atoms, we can say that v 
is a possible numeration precisely if v € (V), which gives us the corollary 
that all numerations in ET are MRs, including among others the empty set Ø 
and a finite number of sets of cardinality n such that n < |V| (maximally V 
itself). 

What I propose is that the numeration itself is essentially employed 
as first merge in the building of melodic structure. This fits well with a 
general oddity of first merge, namely that it is the only step in a syntactic 
derivation which applies external merge to more than one external item. 
A possible explanation for syntactic first merge not admitting the full 
numeration could be that this would not allow for other necessary conditions 
to be obtainable, for example, subcategorisation, feature checking, interface 
requirements, etc., upon second merge. Since an MR consists only of atoms, 
without feature-checking requirements or similar, the restriction does not 
arise. 


(7) (a) Given a grammar G = (V,R,C), let any v E€ (V) be called a 
numeration. 
(b) Every derivation in G must begin with a numeration. 
(c) Every numeration in G is a melodic representation (MR) in G. 


From (7), we get all the possible unheaded MRs, including the empty 
representation | |. Given that any subset of V is an MR, we should be able 
to apply merge to that MR. If we apply merge to a non-empty numeration 
and merge with some x € v (that is, we apply internal merge), we get an 
overall asymmetric, potentially ‘imbalanced’ structure. For example, let v = 
{H, ?, U}. If we stop the derivation here, we have the unheaded MR |H, ?, UJ]. 
If we now apply second merge (axiomatically always internal merge) on some 
x E v, say H, we have merge(H, {H,?,U}) = {H, {H,?,U}}, where H is 
what we conventionally call the head in a singly headed MR such as |H, ?, U| 
(a more formal characterisation of head and complement follows). Note that, if 
v = Ø, internal merge is not possible. Example (8) illustrates some 
correspondences between MRs in set- and conventional ET-notation. 


(8) | |=9, 
JA, I] = {A,1}, 
IA, Il = {L {A.D}, 
JA, L, 2] = {A, {A, L, 3}. 
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Much like Breit (2013) and Ulfsbjorninn (2021), this suggests that the basic 
mechanism of structure building underlying melody (vis-a-vis phonology) 
and syntax may show deep homology. The most striking and fundamental 
difference is that melodic structure building (at least under standard 
assumptions) is limited to second merge: 


(9) Given a numeration v in G, merge(a,f) may be applied iffa € £ and 
Bev. 


In other words, the set of all possible MRs © is exhausted by numeration 
followed by at most one application of internal merge. 

Before moving on, let us briefly consider whether weakening restriction 
(9) may be desirable. Explanatorily, the answer ought to be yes, (9) is 
stipulative, and one would hope that we could explain the restriction as arising 
from some third factor, for example, requirements at the interface to phonetic 
interpretation. Extensionally, under the most restrictive variant of standard 
ET, we cannot forgo it however, adding further to the impression of severely 
constrained structure building in this subcomponent. However, there is also 
a relatively well-established view in ET that some melodic expressions 
are ‘unfused’, argued to represent contour segments such as [tf] and light 
diphthongs. Breit (2017) suggests that these could be modelled by representa- 
tions where a higher primitive is not a member of first merge. In the terms 
of this model, that ‘unfused’ melody is external merge. Allowing one cycle 
of external merge (and excluding internal after external merge) would give 
us additional specifier-like structures such as {H, {I, {1,?}}} (= unfused 
|L 2|~|H]) and {I, {A} } (= unfused |A|~|I/). 


Head and complement 


The head-dependency relation is one of the most important aspects of ET 
representations, which matters not only for (prosodic) well-formedness and 
phonological processes, but also at phonetic interpretation. Let us first turn 
our attention to the head relation head C (È x #9(V)). As shown in the section 
above, the derived sets are all either of the form Ø, {a,...} or {a, {a,...}}. 
The former two are headless, the latter has a head a. head should thus map 
representations of the first two types to Ø, and the latter to {a} (recall that 
a ¢ X). This is actually (and perhaps surprisingly) very close to the effect 
of the labelling relation proposed in Chomsky (2008), which states that for a 
syntactic object x = {a,f}, a is the label of x if a is a lexical item and £ is an 
XP. The definition in (10) adapts this to ET: 
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(10) (a) A member a of an MR X is called the head of X iff a is a 
vocabulary item and at least one member of X contains a. 
(b) head(X) = {a E V: (a,b E X)A(a E P). 


Example (11) illustrates the relation (10b) for a selection of MRs: 


(11) head(Ø) = Ø, I ||| 
head({A,H,?}) = Ø, |A, H, ?| > | | 
head({A, {A,H,?}}) = {A}, |A, H, 2] > |A] 
head({A, {A}}) = {A}. |A| > |A| 


A perhaps more contentious question is what the right characterisation of 
the set of dependents should be. Kaye (2000, 2001), following Kaye et al. 
(1985), calls these the operators (the term dependents is also used) and means 
all the elements in a representation that are not the head. Breit (2013, 2017) 
and Ulfsbjorninn (2021) argue for a view of a (structural) complement which 
explicitly includes the embedded copy of the head. I will side with the latter 
view. Suffice it to say once we have a relation cmpl C (È x §o(V)) mapping 
to the complement of a representation, we can define the relation to the set 
of operators simply as ops(X) = cmpl(X) — head(X). The complement and 
its relation cmpl are defined (preliminarily*) as the set of vocabulary items 
(embedded) in the symmetric difference of X and its head, as in (12). 


(12) (a) A member a in a (potentially flattened) MR X is called a 
complement of X iff a is not a head of X. 
(b) cmpl(X) = {a € V: (a € (X © head(X))) v 
(a € BAB E (XO head(X)))}. 


Example (13) illustrates the relation (12b) for a selection of MRs: 


(13) cmpi(Ø) = Ø, Il>I] 
cmpl({A,H,?}) = {A,H, 2}, |A, H, ?| — |A, H, ?| 
cmpi({A, {A,H,?}}) = {A,H,?}, |A, H, 2| — |A, H, 2| 
cmpi({A, {A}}) = {A}. lA] > |A| 


A relatively interesting consequence of taking this more explicitly cyclic deri- 
vational approach based on symmetric merge is that head and cmpl now both 
range over (V), not È as in Breit (2013). As such these relations, important 
for specifying the conditions under which phonological processes take place, 
essentially implement syntactic demerge, argued by Fukui and Takano (1998) 
to play an important role in top-down linearisation. 
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Composition and decomposition 


Another important set of relations implements the composition and decom- 
position of primitives on existing representations. Following the arguments 
in Breit (2013, 28-31), I suggest these come in a complement- and a head- 
oriented flavour each, for example, compc, comph, decompc, decomph (all in 
(2, (V)) X X). Due to limited space, and because I have nothing much to add 
over Breit (2013), I will not further define or discuss the composition/decom- 
position operations here, though note that this proposed split fits rather well 
with the head and complement relations’ demerge-like character. 


The constraint set C 


I have argued that all the components of G discussed thus far (assuming of 
course a single version of ET) are universal. However, were this the case 
for the entire grammar, it would imply the claim that the range of generated/ 
generable MRs is identical across all languages. Were this so, the fact that 
the attested inventories of MRs employed frequently differs from language to 
language would have to be explained merely as a consequence of diachronic 
happenchance. That is to say, the reason that for instance English makes use 
of |A, U| but not *|I, U|, while Finnish employs |I, U| but not *|A, U| is simply 
a consequence of the way in which their phonetic surface forms and lexical 
items have been shaped over time, in each case simply implementing the 
optimal encoding according to some shared algorithm given as its input the 
experienced phonetic surface forms of a sizeable proportion of the respective 
languages’ lexicon. Consequently, an English speaker would be free at any 
time to posit a new underlying representation containing |I, U| if that were the 
optimal representation for what they needed to represent, as would a Finnish 
speaker with |A, UJ. 

Albeit perhaps counter to the intuitions or inclinations of many working 
in the ‘maximally restrictive’ tradition of GP, I propose that we should actually 
adopt an only slightly more constrained version of this hypothesis. What the 
above formulation is failing to account for is that some languages provide 
hard evidence (that is, evidence within their respective phonological system) 
that certain configurations of elements are not permitted, or actively ruled 
out, by their phonology. In ET, specific MRs are ruled out through Licensing 
Constraints (LCs; Charette and Göksel 1996, 1998; Kaye 1993, 2001; 
Ploch 1999). While there is still much work to be done on LCs, especially 
in consonants, it has become clear by now that these are highly systematic. 
Example (14) attempts to give an exhaustive list of the types of LCs employed 
in ET, where a, E€ V,a + f, and operator means a E€ cmpl(X) — head(X): 
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(14) (a) All MRs must be headed. 
(b) a may not license operators. 
(c) a must license operators. 
(d) a must be licensed. 
(e) a may not be licensed. 
(£) a may not be head. 
(g) a must be head. 
(h) a may not license 2. 
(i) a and f} may not combine. 


Using such LCs, the English—Finnish situation above is usually accounted for 
by assuming that in English |I| and |U| may not combine (141), while in Finnish 
|U| may not be licensed (14e). 

What I propose is that the constraint set C is essentially a set of 
predicates, limited to a very specific form, which taken together delineate 
a subset K of È, functioning as a filter on what G can generate with R over 
V (note that C does not properly partition È since potentially K € {@,=}). 
Importantly, LCs do not interfere directly with the structure building process 
(for example, blocking a certain numeration). They instead place a limit on the 
eventual output condition of a derivational chain, meaning that intermediate 
steps in a derivation may violate some LC, but the final derived object may 
not. Based on this we can define the specific set of MRs L(G) generated by G 
as follows: 


(15) A grammar G = (V,R,C) generates an MRS € Lif there is a 
sequence of MRs S,,...,S, such that S, € go(V), S; is derivable from 

S,_, by a relation in R foreach2 < i < n,andS, ¢ K. 

LCs can be systematised as predicates, each of which selects a subset of £, so 
that K is the infinitary union of those sets (since we have construed L(G) as 
CK). Table 4.1 shows the LCs with proposed predicate forms. Stated in this 
way it becomes immediately apparent that there is a very restrictive definition 
of what is a well-formed constraint predicate, promising further reduction and 
generalisability of LCs. 

Though I am optimistic that LCs can be successfully generalised even 
further going forward, for now it at least seems clear that we need at most three 
general forms: 

First, (a-c) are about head-complement (non)-identity. Their range can 
be captured with two variables P,A with domains {=, 4} and {4: A C VA 
|A| < 1}, so that we can generate their predicates with a function conident(P, A) 
œ> A = head(S) ^ A P cmpl(S). LC (a°) does not form part of the established 
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Table 4.1 Licensing constraints reframed as a complement in È. 


Condition As C, — {S € 2:...} 


(a) All MRs must be headed { head(S) A { } # cmpl(S) 
(a’) (no empty representation) { } = head(S) A { } cmpl(S) 


(b) amay not license operators {a} = head(S) A {a} # cmpl(S) 
(c) a must license operators* {a} = head(S) ^ {a} # cmpl(S) 
(d) a must be licensed { } = head(S) A { } © cmpl(S) 
(e) a may not be licensed { } # head(S) A { } © cmpl(S) 
(f) a may not be head {a} = head(S) A {a} © cmpl(S) 
(g) a must be head {at # head(S) A {a} © cmpl(S) 
(h) amay not license £ {a} # head(S) ^ {B} C cmpl(S) 
(h’) (only a may license £) {a} # head(S) A {p} C cmplS) 
(i) aand £ may not combine {a} C cmpl(S) A {Bt © cmpl(S) 


t in conjunction with (g). 


LC canon, but is implied by this generalisation, and actually reflected in 
a known property, namely that in some languages (non-p-licensed) empty 
skeletal positions are not permitted. 

Second, (d-h) are about head-complement distribution. Looking over 
these constraints, we notice that they imply (h’) — again not part of the 
established canon — and that LCs (d-g) are special cases of (h,h’), with the 
first set Ø in (d,e), and both sets being identical in (f,g). Three variables P,A,B 
are required to capture the range of distributional constraints. P,A have the 
same domain as with conident, and B has the domain {B:B C VA|B| = 1} 
(that is, the domain of A without Ø). Thus condist(P,A,B) œ> A P head(S) 
AB C cmpl(S) captures the range of these LCs. 

Third, (i) implements co-occurrence constraints and is set apart 
principally by not enforcing any condition on the head. Assuming two 
variables A,B again with the domains already given we can generate their 
predicates by conco(A,B) => A C cmpl(S)A B C cmpl(S). 

As mentioned in the section ‘The vocabulary’, this constraint mechanism 
readily captures parameterisation of individual elements (Cyran 1996, 2010). 
Givenanelementx € V, xis totally excluded from L(G) if {condist(=,@, {x}), 
condist(#,O , {x})} © Cg, that is, ifx simultaneously must be licensed (d) and 
may not be licensed (e). In fact, if we assume that the domain of the variable 
A of conco permits the empty set (as suggested above), excluding an element 
x may be even more trivial via conco(Q, {x}). Since Ø is a subset of any set 
by definition, Ø C cmpl(S) is always true and conco(Q, {x}) equivalent to 
{x} C cmpl(S). 

Elemental antagonism (Backley 2011, 2017) attempts to capture 
certain postulated combinatory restrictions of pairs of elements that share 
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substantive traits. For instance, the antagonist hypothesis argues that because 
|L| and |H| both correlate with frequency modulation, their combination 
is marked and if they combine the compound must be asymmetric (that 
is, |L, H| or |L, H| but not *|L, H| or *|L, H|). A principal motivation is to 
restrain multiple headedness to 4 (or three heads assuming |V| = 6; Backley 
2017, 11). While the model I have discussed above assumes single headedness 
(the SOHC), if it is modified to allow ‘complex’ (aka multiple) heads, it seems 
reasonable in turn to also obviate the cardinality restriction on variables in 
the constraint predicates, that is, allowing sets with cardinality greater than 
1 for A,B in conident, condist, and conco. This would allow constraints 
such as conident(=, {x,y}) to rule out the double-headed expression |x, y|, in 
conjunction with condist(=,@ , {x,y}) to rule out the headless antagonistic pair 
Ix, y|. However, even though this seems feasible when considering just two 
elements in isolation, as soon as we consider |V| > 2 it becomes clear that 
there is no straightforward means to generally rule out a symmetric antago- 
nistic pair in compounds with further elements (for example, capturing |x, y, z|, 
IX, y, wl, ... without explicit enumeration of |z, w, ...|). This makes clear that 
while the antagonistic relation proposed by Backley seems to fit well into the 
broad category of LCs, the multiheaded version of ET cannot be sufficiently 
captured purely by weakening the model’s cardinality restrictions. Instead, the 
nature, and possibly number, of constraint mechanisms required appears to 
also be more complex. 


Conclusion 


In this chapter I have proposed several revisions to the formal model of ET 
proposed in Breit (2013), making the model both more comprehensive and 
simpler in several respects. 

As pointed out by Ulfsbjorninn (2021) and argued in this chapter, 
templatism doesn’t appear to align well with the research agenda of Government 
Phonology and ET, which among other things have always sought to explore 
cognitive homology between (morpho-)syntax and phonology and to reduce 
where possible that which is supposed to be ‘special’ about phonology 
vis-a-vis related cognitive faculties. It has been proposed that the structure 
building process underlying both syntax and melodic phonology may be 
essentially the same, namely simple symmetric set-merge. While there is much 
work in the GP/ET sphere that similarly explores more homologous structure 
building (for example, Péchtrager and Kaye 2013 for GP 2.0; Nasukawa 2014 
for Precedence-free Phonology; Cavirani and van Oostendorp 2020 for vowel- 
internal recursion; Ulfsbjorninn 2021 for Bare Element Geometry; see also 
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Chapters 21-23) these all depart quite radically from the most well-established 
mainstream of ET in several ways. What is novel in the specific context given 
here, is that it has been shown that, even within the highly restrictive, non- 
recursive, single-headed standard implementation of ET, there is potentially 
much to be gained not only in elucidating the actual underlying mechanisms 
of melodic structure building (versus focusing on just the resultant represen- 
tations) but in learning what may give rise to observed and/or hypothesised 
differences of the processes involved. For example, it has been suggested here 
that the fairly different structures derived by melody versus syntax may result 
from factors not directly attributable to the underlying cognitive process itself, 
but for instance: to a lack of feature-checking and subcategorisation in melody 
(in turn the result of ET’s monovalent atomic primitives) which permits the 
entire numeration to feature as an input to merge; to a more strictly fixed 
sequence of derivational cycles (possibly linked to the specific processing 
needs of phonology, which must yield very rapid and robust output to 
facilitate fast lexical access); and to the different interface-requirements faced 
by the two modules. It certainly seems worthwhile to explore these factors 
much further as they clearly have the potential to arrive at better explanations 
regarding melody-specific assumptions. 

The other main revision proposed concerns the integration of licensing 
constraints within the model of melody. What I have proposed is that LCs 
operate as an output filter on the structures generable by the ET grammar, 
rather than operating either during derivation of those structures or at the level 
of syllabic/prosodic phonology. It has been proposed that all LCs in ET can 
essentially be reduced to three types, constraining head-complement identity, 
head-complement distribution, and co-occurrence respectively. It may be 
possible that further work on LCs can lead to even better generalisations over 
the set of possible LCs. Integrating LCs at this stage of the grammar offers 
several interesting questions to pursue further, for instance to what degree there 
is evidence that melodic representations not employed by a given language (in 
the absence of positive evidence for a learner to posit an LC) are available to 
speakers of that language (for example, in representing the melody of loans). 
Another interesting aspect concerns generative capacity: it has often been seen 
as an argument for ET that it predicts much smaller segment inventories and 
fewer languages, closer to what is attested than say SPE-style feature theories 
(cf. Breit 2013, and references therein), though it has been pointed out that 
this may not be the true metric we might want to apply concerning melody 
(Reiss 2012). ET with integrated LCs offers a different mechanism by which 
segment inventories are generated and it is quite likely that many different 
constraint sets give rise to the same final inventory, so that further investiga- 
tion on possible constraint sets in ET offers novel measures on the predicted 
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sizes and distributions of various melodic systems rather than just two flat 
measures of possible segments and possible inventories as in previous work 
on the generative capacity of ET. 

Extending the application of the LC mechanism to elemental antagonism 
in multiheaded ET revealed that while, on the one hand, the implementation of 
the antagonism hypothesis itself may not necessitate changes and additions to 
the structural assumptions about ET beyond allowing multiple heads (contra 
to what Backley 2017 seems to suggest), on the other it seems clear that 
either the specific consequences of antagonism (especially with respect to the 
unheaded occurrence of both elements in an antagonistic pair) or the species 
of LCs required in multiheaded ET must be revisited, potentially pointing to 
another aspect in which that version of ET is less parsimonious than single- 
headed ET. 


Notes 


1. I thank Andrew Nevins, Connor Youngberg, Kuniya Nasukawa, Markus Péchtrager, Shanti 
Ulfsbjorninn, Sixto Rodriguez, Tobias Scheer, and especially John Harris and Monik Charette, for 
encouragement and useful discussions on some of these ideas. 

2. MRs are also commonly referred to as phonological expressions (PEs) or (sub-)segmental 
representations. I use the term MR here to avoid ambiguity (expression versus representation; 
phonology versus melody; ‘SR’ could refer to ‘surface representation’). 

3. Kuratowski (1921) proposes (x,y) = { {x}, (x,y}}, often adapted to ‘short Kuratowski’ {x, ¢x,y}} (for 
example, pair-merge in Chomsky 1995). 

4. The simple definition in (12) hinges on the limited, single merge cycle, which does not allow structures 
of the form {y, {a,8,...}} (where y ¢ {a,}). If we want to incorporate such structures (for instance 
to cover Bare Element Geometry, or unfused melodic expressions, the definition of the cmp/ relation 
needs to be revised. Note also the clumsy disjunction to extract embedded members if head(X) # @, 
a consequence of the mixed member types in short Kuratowski pair notation {a, {a,}} (infinitary 
union resolves this more elegantly with { {a}, {a,f}}). 
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Production bias and substance-free 
representation of laryngeal 
distinctions 


Eugeniusz Cyran 


Introduction 


A rather marginal yet intriguing phenomenon of voicing retention occurs in 
Polish in the phonological context for final obstruent devoicing (FOD). It has 
a number of sources: phonetic, pragmatic, and possibly also phonological. 
It may be argued that it is similar in nature to the problem of incomplete 
neutralisation. One well-known example of voicing retention, which is 
clearly motivated by pragmatics, arises to disambiguate the message, avoid 
homophony, or even rudeness, as in dób [dup] ‘day and night, gen.pl.’ and 
kod [kot] ‘code’, which are homophonous with dup [dup] ‘arse, gen.pl.’ 
and kot [kot] ‘cat’. Polish speakers consciously avoid the homophony by 
pronouncing the former two as [dub] and [kod] (Gussmann 2007). Whatever 
explanation is given to account for such variation occurring in monitored 
speech, modularity suggests that pragmatics, or even phonetics should have no 
access to phonology. However, the question remains as to how it is possible 
to bypass regular phonology when voicing is retained. It will be argued that 
such phenomena provide us with a number of analytically relevant tips as to 
how sound systems work. 


Theoretical context 


It is generally assumed that the linguistic utterances we produce are externali- 
sations of phonological forms, or result from phonological derivation. Given 
a system in which the phonology has a process of devoicing, voicing retention 
suggests that the ‘rule’ of devoicing is blocked, or that, for some reason, what 
is pronounced is based directly on the underlying form, which amounts to the 
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same thing. Thus, for phonology, if this is where the phenomenon is rooted, 
the question is two-fold: (i) how retention can be formally captured, and 
(ii) what is the nature of the interaction between phonology on the one hand, 
and the extra-linguistic aspects listed above. 

Phonology is responsible for the nature and distribution of the laryngeal 
categories in the phonological representation. In this chapter, the represen- 
tation of the laryngeal distinction is assumed to be strictly privative. In a 
language with two series of obstruents only one is marked, while sonorant 
consonants and vowels are never marked laryngeally. These tenets are widely 
accepted within, for example, Element Theory (Backley 2011; Harris and 
Lindsey 1995) and more generally within Laryngeal Realism (Beckman et al. 
2013; Harris 1994; Honeybone 2002; Iverson and Salmons 1995), according 
to which, Polish, which is a ‘voice’ system, marks the voiced series of 
obstruents, for example, /p°/ versus /b™/. 

The second aspect of phonological conditioning, which underlies 
the distribution of the laryngeal prime in phonological representation, is 
the mechanism of licensing (Charette 1990). In Polish, as in other Slavic 
languages, the laryngeal licensing (LarLic) is discharged by vowels. The 
licensing goes from the melodically filled nucleus to the preceding obstruent, 
with or without an intervening sonorant (Cyran 2014). If the nucleus is 
empty, the laryngeal licensing fails. Schematically, we may therefore refer 
to the following two types of strings: licensing C'"(R)V and unlicensing 
C°(R)O. 

To answer the question where the phenomena of voicing retention 
belong a particular view of sound system is assumed in this chapter. It 
comprises the phonological module with its representation and computation 
on the one hand, and language specific phonetics (LSP), which is the totality 
of phonetic knowledge concerning a given language, and universal phonetics 
(UP) on the other. The two domains of the sound system are mediated by 
largely arbitrary, that is, lexicalised spell-out relations (Scheer 2014). 


(1) Sound System 
Phonology <—— spell-out ——-» LSP+UP 


A distinction should be made between sound systems and sound patterns. The 
latter is understood as the totality of phonetic, surface, effects and knowledge 
that speakers are consciously aware of, and linguists collect as data. In other 
words, a sound pattern is synonymous with LSP, and merely constitutes a 
fragment of a given sound system. 

Below, we consider each of the above aspects of the laryngeal sound 
system beginning with a possible phonological analysis of voicing retention. 
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Hypothesis I: a phonological approach in which 
delaryngealisation is blocked 


Under this hypothesis devoicing is due to delaryngealisation. Voicing 
retention, on the other hand, is a case of the phonetic interpretation of the 
underlying representation, rather than the post-phonological form. Given that 
the distribution of the laryngeal distinctions in Polish is regulated by licensing, 
one way of capturing register switches, or hyper-correction, is to refer to 
strengthening of the licensing properties of nuclei. 

An ad hoc phonological analysis to this effect is offered in Cyran (2014). 
It builds on the observation that languages seem to employ slightly different 
licensing properties of different types of nuclei depending on the register. One 
example comes from Malayalam in which the final empty nucleus is allowed 
to license more types of consonants in formal than in colloquial speech, 
producing variation of the type [paal ~ paalo] ‘milk’, [wayar ~ wayaro] 
‘stomach’. Malayalam resembles Polish voicing retention in that the final 
empty nucleus can license more in formal speech than in colloquial/unguarded 
speech. When voicing is retained in kod and dob, it appears that Polish 
modifies the target of licensing rather than the trigger: there is no observable 
change in the licensor. However, phonologically, we could claim that the 
modification is in fact in the trigger: it is given more licensing power in 
monitored speech, than it generally has. The analysis of voice retention in kod 
[kot ~ kod] ‘code’ is illustrated below in (2). 


(2) Unguarded speech [kot] Monitored speech [kod] 
yt = 
a C V C V — >b C VC V 
| | | LarLic | | | 
kad strengthening k o id 
=+ | 


Generally, the idea is that there is a critical point at which a given type of 
licensor can barely license the preceding structure. This is where variation 
connected with register switches occurs in Malayalam and possibly Polish. 
There are a few theoretical consequences of the above proposal. The biggest 
one is that grammar-external considerations influence the computation of 
the phonological module. This is at odds with the modular architecture of 
grammar. 
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Hypothesis II: there is no phonological 
delaryngealisation ever 


An alternative assumption to the one in which retention is a case of delaryn- 
gealisation blocking could be that the actual computational phenomenon of 
phonological delaryngealisation simply does not exist. As a consequence the 
existence of laryngeal licensing becomes superfluous. 

The idea that there is no delaryngealisation is explicitly proposed 
by van der Hulst (2015). It is claimed that the property called |fortis| is 
present in voiceless obstruents in both aspiration and voicing languages. 
The difference between such systems follows from language specific 
implementation enhancement rules. In some adverse contexts, for example, 
word-finally, such rules do not operate leaving a phonological distinction 
which is deprived of phonetic enhancement, but no |fortis|-loss occurs. The 
remaining prime is claimed to be responsible for the small distinctions 
between the lexically voiced and lexically voiceless obstruents which are 
known as incomplete neutralisation (for example, Slowiaczek and Dinnsen 
1985). 

Thus, the attraction of the no-delaryngealisation hypothesis is obvious. 
It is a viable phonological response to the growing evidence for the incom- 
pleteness of neutralisation. As a consequence of this proposal, the explanation 
for FOD, and indeed voice retention phenomena, rests completely outside 
phonology. The obvious candidates that are left are the spell-out and the 
phonetics (LSP+UP). 

Apart from the fact that laryngeal licensing becomes redundant, virtually 
all computation to do with |Lar| vanishes, possibly, including |Lar|-spreading. 
Thus, what is left in the phonology is merely the lexicalised distinction 
which is responsible for the voiced/voiceless contrast, or, in fact, the lexical 
distinction which will determine which phonological objects undergo which 
battery of phonetic enhancement rules in relevant contexts. Without the 
enhancement rules, the distinction between the lexical representations may 
only produce very similar objects in a particular context. 

Let us consider where exactly the implementation rules reside in our 
tripartite model of the sound system. Is it in the spell-out, or in LSP? 


The nature of spell-out 


Within Element Theory and Government Phonology (Charette 1990; Harris 
1994; Harris and Lindsey 1995; Kaye et al. (KLV) 1990), phonetic interpreta- 
tion of phonological representations seems to be construed mainly as interpre- 
tation of segments based on prior phonological processing due to the prosodic 
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context, rather than interpretation of segments in context.! Thus, given that 
[p] and [b] in Polish are /U,h,?/ and /U,h,?,L/, respectively, FOD is a case of 
phonological lenition under weak licensing (for example, Harris 1990, 1994), 
whereby a segment /C!/ is turned into /C°/. The following empty nucleus 
itself is not taken into account in the interpretation. The prosodic context 
is assumed to have modified the internal representation of a given segment 
before it is phonetically interpreted. This view is echoed in recent proposals 
(for example, Scheer 2014) in which spell-out (<>) is understood as lexicalised 
lists of segment types, for example, /X/ <> [b] versus /Y/ < [p], rather than 
strings of the type /X@/ + [p] versus /XV/ + [b]. 

On the other hand, in van der Hulst (2015) the context is crucial for the 
interpretation because it triggers particular enhancement rules, or not. The 
question is, if it is the phonological or the phonetic context that must be taken 
into account. The answer should probably be as follows: phonetic, because 
the enhancement rules are in the phonetics, or at least outside phonology. 
Thus, under the non-delaryngealisation assumption, there is almost no 
phonological computation in FOD. The phonological object makes it through 
the phonological processing unscathed, and is phonetically interpreted 
depending on the phonetic context. It may, therefore, be claimed that if there 
is any computation it occurs in the phonetic domain. There is no phonological 
allophony, but only a phonetic one. Consequently, what begins to loom large 
is the question of the nature of the computation at the level of phonetics, that 
is in LSP.? 

Both hypotheses discussed above seem to rely on the same premise: 
that there is no delaryngealisation. However, they differ markedly with 
respect to the phenomenon of voice retention in Polish. The phonological 
analysis of retention in the section of this chapter ‘Hypothesis I: A phono- 


> 


logical approach ...’ assumes the absence of delaryngealisation only in the 
case of the interaction between phonology and pragmatics. In this approach, 
FOD, when it does occur, is a result of phonological computation. In the 
‘no-delaryngealisation-ever’ approach, FOD is a result of the absence of 
phonological and indeed phonetic computation. FOD follows from the 
absence of enhancement rules in a given context. 

Since in van der Hulst (2015) it is the voiceless series that carries the 
property |fortis| and the voiced series is only voiced due to the enhancement 
rules, it may be assumed that in the case of Polish voicing retention in kod 
the rules of voice enhancement are simply generalised to all positions. The 
problem of modularity does not arise, because LSP may be viewed as the 
domain of grammar-external knowledge. That is, one that is not a result of 
phonological acquisition, but rather the basis for phonological acquisition. 
This is where the knowledge of language specific phonetics, that is, the 
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knowledge of major allophones, alternations, and other observable sound 
patterns interact with the knowledge of spelling, pragmatics, etc. 

To sum up, the ‘no-delaryngealisation-ever’ hypothesis leads to a 
number of positive outcomes. It allows us to automatically place phenomena 
such as voice retention in the domain in which it guarantees that modularity is 
not undermined. It also offers a surprisingly simple explanation of incomplete 
neutralisation. This assumption, however, also has some grave consequences 
for phonology. One of them is that almost no phonological computation is 
left to describe or explain the sound patterns related to voicing phenomena. 
Another interesting outcome of such a strong component of enhancement 
rules within LSP is that it seems no longer linguistically relevant which 
obstruent should be marked in a two-way system. The only thing that is 
important is that one of them is. The rest is due to the relevant enhancement 
rules. 

Below, we consider an assumption which combines some elements of 
both approaches discussed above. 


Hypothesis III: Polish has phonological delaryngealisation, 
voice retention is an LSP phenomenon, it does matter 
which series of obstruents is marked, enhancement has 
something to do with articulatory planning, less so with 
computation 


There is evidence that FOD is at least partly phonological in Polish — as 
delaryngealisation due to the absence of laryngeal licensing — and that voicing 
retention, just as incomplete neutralisation, is entirely restricted to LSP. There 
is also evidence that it matters which series of obstruents is marked. The 
evidence comes from sandhi phenomena. 

Depending on whether word-boundary is visible to the application 
of enhancement rules, the ‘no-delaryngealisation-ever’ assumption predicts 
two sandhi patterns, of which one possibly corresponds with Macedonian 
(Korytowska 2012), and the other with the Warsaw Polish (WP) dialect. What 
is not predicted is the existence of Pattern 3, which is found in Cracow-Poznan 
Polish (CPP). All the patterns involve FOD in final position, unless retention 
occurs (Pattern 4). 
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(3) ‘no-delaryngealisation-ever’, boundary visibility and retention 


# boundary visibility voicing retention 
No Yes as lexical identity 
Pattern 1 Pattern 2 Pattern 3 Pattern 4 
Macedonian WP CPP Mac., WP, CPP 


[Di —>[T] DH  3[T] D# —[T] D# > [D] 
/D/#/V/— [D] /D/#/V/—> [T] /D/#/V/— [D] /D/#/V/—> [D] 
ITNI 3 [T] /T/#/V/ > [T] /THV/ = [D] /TH/V/ > [T] 


Pattern 1 indicates that enhancement rules do not apply word-finally, hence 
FOD takes place. The lexical identity is maintained across a word boundary, 
which means that the enhancement rules do see the following vowel and apply, 
ignoring the intervening boundary. This alone suggests surface (LSP) condi- 
tioning of the enhancement rules. The other pattern which is consistent with the 
‘no-delaryngealisation-ever’ hypothesis is Pattern 2. This time, however, one 
has to assume that the word boundary is visible and blocks the enhancement 
rules, suggesting that either the enhancement rules must operate on deeper 
representation, or that the underlying distinction between /D/ and /T/ is no 
longer present, for example, due to delaryngealisation. The problem concerning 
the level of application of the enhancement rules is further complicated by 
Pattern 3. It partly corresponds to Pattern 1, in that FOD occurs word-finally, 
and /D/ is enhanced before a vowel-initial word, suggesting boundary invis- 
ibility. However, it would not be correct to say that the word boundary is 
ignored because the lexical /T/ is enhanced, as if it was a /D/. Enhancement, 
as the name suggests, should amplify the lexical distinction, not neutralise it. 
Therefore, one should conclude that in Pattern 3 the boundary is visible and it 
provides a phonological context for the loss of the laryngeal distinction. On 
the other hand, enhancement operates at the phonetic level and is blind to the 
boundaries. Finally, Pattern 4 is found in all the example languages as a case 
of voicing retention which ignores both boundaries and enhancement contexts. 

As argued in Cyran (2014), the distinction between Patterns 2 and 3 
can be understood within a strictly privative account only if it is assumed that 
(i) word-final obstruents are delaryngealised in both dialects; (ii) the laryngeal 
representation of the obstruents in CPP is the opposite of WP, that is, WP 
has /D'*"/ = [d] versus /T°/ = [t] and CPP has /D°/ = [d] versus /T™"/ = [t], 
and (iii) there is no difference between word-internal C°V and the sandhi 
C°#V contexts with respect to enhancement of /C°/. Whether it is phonetically 
interpreted as voiced or voiceless is strictly related to the regular spell-out 
relations established for the unmarked obstruent in the respective systems 
during acquisition. 
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Production bias and acquisitional amnesia 


First of all, the laryngeal relativism described above rejects the realist view 
that /C°/ has a universal phonetic interpretation as voiceless unaspirated. 
Likewise, |Lar| corresponds to neither voicing nor aspiration. Its choice does 
not a priori determine the surface phonetic categories, that is, whether it is a 
voicing system [b-p], or an aspiration one [p-p"]. The deterministic approach 
to laryngeal marking will be viewed here as an instance of production bias. 
We assume that the choice of the phonetic categories, for example, [b-p], is an 
LSP choice, based on the [b-p-p"] distinction provided by UP (4a). 

The acquisition of the laryngeal system proceeds from LSP, where 
languages divide into ‘voicing’ and ‘aspiration’ systems (4c). 


(4) Acquisition of the laryngeal system 


phonetic distance 


a. UP [bj [p]------- [p"] 
b. LSP choice 7 i K 
c. ‘voicing’ language [b]------- [p] [p] pannes [p"] ‘aspiration’ 
2, ¥. As language 
d. systemic choice “Ge, h & $ 
ER 
e. phonology |Lar| 


By the time the acquisitional decision is made concerning the way the relevant 
two-way distinction is to be represented in the phonology (4d), the LSP 
knowledge already comprises a number of aspects which constitute the basis 
of this systemic choice. The decision is an automatic act of establishing the 
spell-out relations between the surface phonetic object representing a totality 
of acoustic, perceptual, and articulatory aspects and the symbolic privative 
representation of the laryngeal distinction. 

LSP involves knowing the main distinctive phonetic categories/major 
allophones, for example, [b, p], that they are fully distinctive as targets of 
spell-out in the context _(R)V, and that they are related to particular modified 
forms in other contexts than _(R)V. These sound patterns constitute the basis 
for the systemic choice/acquisition of the phonological representation and 
computation that generalise the patterns in the mind. 
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(5) LSP patterns 
a. /B/ > [b] /_ (R)V 
/B/ — [b] /__Ctvo and _#(R)V in CPP 
/B/ — [p] /__#,._C-Y% and __#(R)V in WP 
b. /P/ > [p] /__(R)V 
/P/ — [p] /__#,__ C and __#(R)V in WP 
/P/ => [b] /__CHei and _ #(R)V in CPP 


LSP has two important properties: (i) it constitutes the basis of phonological 
acquisition, and (ii) this knowledge does not disappear after phonological 
acquisition is complete, and, together with UP, plays a role in phonetic 
implementation. 

The representational choice has to reflect the patterns in (5), but it 
need not be a direct translation of phonetic patterns into phonological rules, 
or a direct translation of acoustic properties into phonological features. 
Importantly, it is not a phonetic decision. The choice of the marked series will 
have no bearing on the phonetic categories in (4c), because they are already 
given: the input to acquisition will later constitute the output of production. 
What matters in the representational decision is the phonological behaviour 
of the marked segments, for example, that they require licensing. It follows 
that the property |Lar| can indeed be fully symbolic and need not contain 
substance and the interpretation of the neutral obstruent is system specific. 
Thus, in principle, there is nothing wrong with the fact that /C°/ is phonetically 
interpreted as [p] in one system and [b] in another. 

As for phonetic interpretation, we may assume that the target of, say, /b/ 
at spell-out is the [b] which is found in the _ (R)V context. It may be 
somewhat idealised as a gross acoustic pattern (Harris and Lindsey 1995). 
However, spell-out does not consider the phonetic context. The context will 
have an effect on the final phonetic interpretation of the target. Thus, spell-out, 
or translation, may be viewed as slightly different from phonetic interpreta- 
tion. The former is automatic and arbitrary, while phonetic interpretation 
involves contextual influence. If so, then enhancement rules may be viewed as 
an effect of articulatory planning, rather than computation of any sort. 

Finally, the knowledge of LSP is not forgotten. It is the basis for the 
phonological acquisition and remains as the speakers’ conscious competence. 
This fact can be used in explaining the phenomenon of voicing retention, as 
well as, it is hoped, incomplete neutralisation. 
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Voice retention, an LSP take 


It should be recalled that what FOD, voicing retention, and incomplete 
neutralisation have in common is the word-final context, in which the LSP 
knowledge, for example, that illustrated in (5), involves not only the awareness 
that obstruents must be voiceless, but also the fact that they have two lexical 
sources: a voiceless obstruent, or a voiced one. This is because at this level 
of linguistic knowledge speakers know the alternations and they are able to 
identify the major allophones. This is also where other grammar-external 
aspects interact, such as experience, pragmatics, knowledge of spelling, word 
familiarity, etc. It is in LSP that speakers can consciously manipulate phonetic 
forms on the basis of the LSP patterns and various relations, and this is where 
voice retention should belong. 


Incomplete neutralisation 


Since the early 80s, experimental studies have been showing that the 
neutralisation of the voicing contrast in languages like Polish is incomplete, 
and that it requires a phonological account (for example, Slowiaczek and 
Dinnsen 1985). In the light of our discussion of the laryngeal system, the idea 
that incomplete neutralisation should be expressed in the phonology is based 
on two misconceptions: production bias and acquisitional amnesia. This 
view wrongly assumes that phonetic forms (sound patterns) are generated 
by the phonological system and implementation rules, with no reference 
to the crucial aspect of sound systems, that is, LSP. The domain in which 
conscious awareness of the patterns prevails. It is in LSP that the variants 
are controlled by such aspects as word familiarity, frequency, tempo of 
speech, degree of speech monitoring by speakers, spelling, pragmatics, 
and more importantly, also the familiarity with the sounds patterns, such as 
alternations. 


Conclusion 


This chapter addressed the question of the structure of the sound system in 
relation to familiar though marginal laryngeal phenomena in Polish. The 
analysis of voicing retention in Polish, which is a case of suspension of a 
regular phenomenon of final obstruent devoicing, allowed us to clarify the 
role of phonology, spell-out conventions, and language specific phonetics in 
sound systems. The acquisition-centred perspective affords a view in which 
phonological categories may be deprived of substance, and phonological 
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computation is rather small. It is restricted to licensing or unlicensing of the 
category |Lar| in the phonology. 

A simple view of LSP is advocated, in which no computation is 
assumed. What might appear to be contextual phonetic implementation rules, 
or enhancement rules, are claimed to be the result of articulatory planning, 
which is part of LSP, and universal phonetics. It is suggested that laryngeal 
phenomena such as voicing retention or incomplete neutralisation are best 
viewed as LSP phenomena. 


Notes 


1. One exception is the interpretation of empty positions which are vocalised if not p-licensed (Kaye et al. 
1990). 
2. Alternatively, the spell-out is not list-like, but process-like, which also has its consequences. 
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The no-crossing constraint: 
a neglected licensing constraint 


John R. Rennison 


Background 


In Rennison (1996), I proposed an analysis of the ATR harmony of Mooré 
based on the relative ordering of the elements |A| and |ATR| on their shared or 
‘fused’ tier, working in conjunction with the No-Crossing Constraint (NCC), 
which for reference is stated in (1). 


(1) The ‘No-Crossing Constraint’ (NCC) 
Association lines do not cross. (Goldsmith 1976, 48)! 


This is the only proposal I know of that accounts for the fact that in Mooré 
only high ATR vowels (that is, /i/ and /u/) trigger harmony, but mid ATR 
vowels do not (and there is no lexical low ATR vowel). In other words: the 
stem vowels {i, u} trigger ATR harmony, but {1, e, £, a, 9, 0, U} do not. This is 
exemplified in (2) with the stem vowels /i/ and /e/. Suffix vowels are /1, v, a/ 
(that is, never mid, never ATR, never nasal). When ATR-harmonised they 
become /i, u, A/ respectively. 
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(2) ATR harmony in Mooré. Only high vowels trigger harmony. X marks 
an NCC violation. 


a. b. 
ATR, Aline > ATR A ATR A A 


skeleton— x Xx X X + X X x xX X X + X X 
I, U line > I I 
transcription > b i i + g A s e I + g a 
gloss —> ‘child’ ‘eagle’ 


This unusual restriction means that even though the word is the domain of 
ATR harmony, there are plenty of disharmonic words in the language — 
namely those with /e/ or /o/ in their stem. 

In this chapter, I will augment this extraordinary case with a new 
analysis of Turkish vowel harmony (VH) in which the elements |A| and |U] 
share a tier. In this way I will demonstrate that (a) tier fusions beyond |I/U| are 
not uncommon, and therefore (b) many LCs are in fact epiphenomena of more 
basic aspects of the theory of GP. 

How can one justify a fusion of tiers? Or, conversely, how can one justify 
the non-fusion of tiers? The theory allows (or once allowed) elements to reside 
on lines/tiers, and the precise configuration of the elements and lines remained 
a matter for empirical investigation. In Kaye et al. (KLV) (1985, 307) the fusion 
of the |I| and |U] tiers is dealt with very briefly and with little justification. It 
is assumed that no skeletal point can be associated to two different elements 
on the fused tier (for example, to both |I| and |U]). In fact, the allocation of 
elements to tiers deserved far more attention (for some early ideas on this, see 
Rennison 1990.). As a first hypothesis, let us assume that any constellation of 
tiers is possible. For the purposes of this chapter, I will restrict myself to the four 
elements |I|, |U], [A], and |ATR| within vowel systems. Note that these elements 
are by no means equal in status. In particular, the element |ATR| demands that at 
least one other element be present in the vowel. Also, for our present typological 
purposes we will largely ignore the elements |H], |L], and |R]. 

The minimum number of elements in a vowel system is probably zero, 
if Circassian [a] is a geminate schwa, as Job (1981) suggests. I am unaware of 
a language in which every logically possible combination of {I, U, A, ATR} 
is fully active/contrastive (giving a vowel system like /ce, y, Y, e, £, i, I, O, 9, U, 
U, A, a, 9, 9/), and its existence would be surprising, since additional elements 
bring additional markedness and additional phonological strength. 

In this chapter I will ignore headedness because we still do not know 
enough about it, and also because it does not seem to play any role in the 
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vowel systems and processes under consideration here. The case of Mooré 
outlined above shows clearly that |ATR| is a normal element (and subject 
to the NCC); therefore, it cannot have anything to do with headedness, and 
switching is not an option. 


Turkish vowel harmony 


In Rennison (1987) I gave my first account of Turkish VH within the 
framework of Government Phonology (GP). Since then we have learned 
more about VH systems and the research programme of GP has advanced and 
matured. Other analyses of Turkish VH have been proposed, notably Charette 
and Göksel (1994, 1996). In the latter study they claim that ‘licensing 
constraints not only determine the vocalic inventory but also explain how 
vowel harmony operates’ (Charette and Göksel 1996, 86). However, I share 
Péchtrager’s (2010) mistrust of licensing constraints in GP.? These are 
extremely problematic because they are completely unrestricted and therefore 
devoid of any explanatory power. Indeed, even the notion of headedness in 
the melodic structure of sounds (to which both licensing constraints and 
Pochtrager refer) is not well understood (though see Rennison and Neubarth 
2003). On the other hand, Charette and Göksel (1994, 1996) and Poéchtrager 
(2010) nowhere show any tiers of elements, and it is not clear whether they 
even have any — even though they make ample use of the autosegmental 
device of spreading. 


My earlier analysis 


Apart from the use of |L| (labial) as the name of what we now call the |U| 
element, the diagrams of eight words, that is, one per stem vowel, each in 
four inflectional forms, are still presentable today. In (3) I give the melodic 
content of the Turkish vowels in terms of elements, and in (4) my earlier 
representations (more neatly formatted, with |U] in place of |L}). 


(3) The melodies of Turkish vowels, without consideration of tiers or 
headedness (for the final version, see (7) below) 


vowel i ii e 6 a o u t 
elements {I TU] ĦAJ) MUA] JA] |UA] |U] | | 
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(4) 


Turkish harmonised noun forms (numbered (19) in Rennison 1987, 
351-53). The lexical shape of the genitive suffix is /in/ and of the 
plural suffix /lar/. To save space, transcriptions are shown instead of 


x’s on the skeleton. 


(a) U-tier 
A-tier 
skeleton 
I-tier 

(b) U-tier 
A-tier 
skeleton 
I-tier 

(c) U-tier 
A-tier 
skeleton 
I-tier 

(d) U-tier 
A-tier 
skeleton 


I-tier 


Nom.sg. Gen.sg. 


(empty) 

ip ip-in 

I I 

(empty) 

kiz kiz-in 

(empty) 

U u 

yüz yüz-ün 
I I 

U uU 

pul pul-un 

(empty) 


Nom.pl. 


kiz-lar 


pul-lar 


Gen.pl. 


A 


ip-ler-in 


I 


A 


kiz-lar-in 


A 


yuz-ler-in 


I 
U 
A 


pul-lar-in 
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(e) U-tier (empty) 


A-tier A A AA AA 

| | || | | 
skeleton el el-in el-ler el-ler-in 
I-tier I I I I 


(£) U-tier (empty) 
A-tier A A A A AA 
| | ltd | 


skeleton sap sap-in sap-lar sap-lar-in 


I-tier (empty) 


(g) U-tier U U U U 
A-tier A a ! A l A 
skeleton de st a A 

(h) U-tier U U U U 
A-tier i A | A A A 
skeleton da nS ae E 


I-tier (empty) 


Under this analysis, all VH in Turkish involves the left-to-right spreading of 
an element from the (last) stem vowel to the suffix vowels. The |I| element 
spreads without restriction, and the |A| element does not spread at all. The 
|U] element spreads only to vowel positions that are not associated to an |A| 
element and cannot skip over a vowel that is associated to an |A|. Thus in the 
fourth column of (4) none of the final morphemes has a |U| element because 
the penultimate morpheme (ler/lar) always contains an |A| element. 
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A tier-sharing analysis 


Tier sharing and multiple associations 


If every phonological element were located on its own independent tier, 
the question of double? associations of a skeletal point to the elements of a 
particular tier would not arise.* However, from the beginnings of Government 
Phonology it has been generally assumed that the elements |I| and |U| share a 
tier in many languages. The existence of multiple associations from a tier to a 
skeletal point raises a further issue relating to the NCC. Is it possible to have 
only single associations in the lexicon, but to produce double associations in 
the phonology? This hypothetical case is illustrated in (5). 


(5) Phonological double associations with only single lexical associations: 
four logical possibilities. NB: B and C, and the x’s to which they are 
associated are not necessarily strictly adjacent. 


a. b. C: d. 
Tier shared by the elements BandC: B C B C C BCB 

pa tei Ta Le 
Skeleton: xX X X X X X X X 


Old High German I-umlaut 


It can easily be seen that this scenario must have been responsible for the 
creation of the majority of front rounded vowels in German: an |I| element 
near the end of a word was allowed to associate to the neighbouring lexically 
specified vowel to its left. This is shown in (6). 


(6) Old High German I-umlaut. 


I/U-tier U I U I I 
Law| L1 ae) 
Skeleton: x@® x XX X X xXx X XXXXXX 
| | | 
A A A 


hor j an[herjan] turil[tyri] gas t i[gesti] 
Examples: <horjan>, New High German hören ‘to hear’; <turi>, NHG 
Tiir(e) ‘door’; <gasti>, NHG [gesta] ‘guests’. 


The resulting vowels, indicated by circles in (6), now have double associations 
to elements on the I/U-tier within the phonology, although in the lexicon there 
are only single associations (namely to |U], but not to |I|). 

Let us now investigate the double associations of Turkish vowels. 
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A new analysis of Turkish vowel harmony 


My new proposal for Turkish VH involves the notion that failures of spreading 
are caused not only by the NCC but also by the ordering of tier-sharing 
elements within a segment. In other words, if we take the NCC seriously, then 
the reason why |U| spreads to the right sometimes, but not always, can only 
be that |U| shares a tier with some element whose association lines or whose 
ordering relative to |U| block that spreading. In Turkish, the only candidate 
element is |A]. 

More formally: In Turkish the elements |U| and |A| may be associated to 
a single vowel position in the lexicon only in the order |A| — |U], but the order 
|U| — |A] is banned both in the lexicon and in derivations.>° This prohibits 
the spreading of |U| to a position already associated with |A|. The vowels of 
Turkish therefore have the representations in (7); but the two vowels in (8) 
are illicit. 


(7) The vowels of Turkish when U and A share a tier and are ordered 


A-U. 
vowel i ü e 6 a o u t 
U/A-tier U A AU A AU U 
I | Vo tit Vo 
skeleton x x x x x x x x 
| | | | 
I-tier I I I I 


(8) Two vowels that are disallowed in Turkish because they violate the 
element ordering |A| — [U| within segments. 


vowel 6 o 
U/A-tier UA UA 
skeleton x x 
I-tier I 


Now consider the representations in (9), in which the elements |U| and 
|A| share the same tier. Here only the words that have both |U| and |A] are 
considered; the other words have effectively the same representations as in 
(4) — but with a fused |U] and |A] tier. 
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(9) The relevant nouns of (4) on a tier-sharing analysis; all other nouns 
retain the representation in (4), except that the |U| and |A| tiers are 
conflated. The lexical shape of the genitive suffix remains /in/ and of 
the plural suffix /lar/. 

X = NCC violation, !X = element ordering violation 


Nom.sg. Gen.sg. Nom.pl. Gen.pl. 


(a) U/A-tier—> U U U A U A 
| PP] Pied. 
skeleton > yüz yüz+ün yüz+ler yüz+ler+in 
I-tier—> I I if L 
(b) U/A-tier > U U U A U A 
| wooo m ae 
skeleton > pul pul+un pul+lar pul+lar+in 
I-tier > 
(c) U/A-tier > AU AU AU A AU A 


Vo Ve) Vo 


skeleton > köy köy+ün köy+ler kéy+ler+in 


I-tier > I I I I 
(d) U/A-tier > AU A U A U A A U A 
VV Va) Vike, 
skeleton > son son+un son+lar son+lar+in 
I-tier > 


Each of the !X association failures would result in a segment having associa- 
tions to the U/A-tier in the illicit order |U| — |A|. However, in (9d) the first 
vowel is associated to both |U| and |A] in the order |A| — |U| and therefore 
perfectly okay. 

So finally we arrive at the two new simplified rules of Turkish vowel 
harmony: 


1. I spreads to the right.’ 
2. U spreads to the right. 


Everything else is taken care of by the element/tier configuration, and 
there is no need for segment-level licensing constraints. 
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Conclusion 


I have proposed here a unified theory in which two old friends, the no-crossing 
constraint and the autosegmental tiers, are used to define possible segment 
inventories and phonological processes (and the constraints on them). This is 
accomplished at the cost of an additional theoretical device: that of element 
ordering, whereby elements which share the same tier and are connected to the 
same skeletal point must occur in a specific linear order. A project for future 
research will therefore be to examine element ordering more closely. 


Notes 


1. This original formulation of the NCC was the second part of the Well-Formedness Condition (of which 

he first part is not relevant here). For the purposes of GP, since tiers are arranged cylindrically around 

he skeleton, we should perhaps add ‘in the same plane’. 

2. Tier sharing, element ordering, and the option of double attachments from tiers to segments determine 

he vowel inventory of a language. Thus for example Nyangumarda (Western Australia) has the 

elements |I, U, A| on a single tier and allows only single attachments from tier to segment. Therefore 

he language has precisely three vowels: /i, u, a/. 

3. I know of no case where more than two elements which share a tier can be associated to a single 

skeletal point. Languages like Nyangumarta quite definitely have three elements |I, U, A| on a single 

ier, but I have found no such language (so far) that seems to allow more than single associations 

of these elements to a single vowel. I suspect that this may be some kind of restriction on overall 

complexity. 

4. This assumes that the autosegmental tiers of elements are arranged cylindrically around the skeleton, 

hus in principle allowing the elements of each tier to associate with the skeleton without interference 
of any kind from elements on any other tier. 

5. It would be nice if the ordering of elements were derived from some higher principle. 

6. The only rule in a phonological derivation is ‘License!’. There is no rule ordering. But I am avoiding the 
word ‘license’ here in order to be perfectly clear about there being no segmental licensing constraints. 

7. Whether the direction of spreading needs to be stated is another question that I am not equipped to 
answer. 
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Prosody and constituent structure: 
a brief introduction 


Yuko Yoshida 


Introduction 


Prosodic and constituent structure in Government Phonology (GP) has 
undergone complexification since the initial syllable proposal of Kahn 
(1976) and divided into approaches where words are composed of syllables 
and moras (Hayes 1983) and constituents which link to a skeletal tier in GP 
(Kaye and Lowenstamm 1984; Kaye et al. (KLV) 1990). ‘Deforestation’ in 
phonology has been a goal since the proposal of Strict CV by Lowenstamm 
(1996), with a lateral alternative replacing the arboreal syllable structures of 
the standard theories (Scheer 2004). The questions posed are what prosodic 
constituency looks like, as well as where and how prosodic phonology 
operates. 

Elements that constitute segments (see Part 1), constituents, and prosodic 
properties pivot around the timing slots, or the timing skeleton, in Standard 
Government Phonology (SGP) (Kaye et al. 1985; 1990; Charette 1991). The 
SGP representations enabled GPists to explicate numerous phonological 
phenomena including syllabic processes with arboreal structures that indicate 
autosegmental behaviour of the constituents. Before those works, McCarthy 
(1979) proposed an intervening tier between segments and syllable structure 
in the analysis of Semitic languages differently from SGP. SGP allowed the 
use of branching rhymal constituents, though Lowenstamm (1996) later boiled 
down syllable structure to only CV constituents, corresponding to ON in SGP. 
Presence of the rhyme meant a banned operation in SGP of ‘resyllabification’: 
the word final consonant (R) is syllabified as the initial onset of the suffixal 
element in the SGP canon. 
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Constituents and government 


SGP recognises branching onsets, branching nuclei, and branching rhymes in 
a strictly binary fashion — no complex onsets of the shape *CCC are permitted, 
and ‘codas’ are defined as rhymal complements consisting ofa single segment. 
Central to the discussion of many of the chapters that follow is the concept of 
Government Licensing (GL) (Charette 1990, 1991), which provides support 
from nuclei to their preceding onset to form either a coda-onset cluster (direct 
GL) or a branching onset (indirect GL). Charette (1990, 1991) provides an 
account for the asymmetric behaviour between the consonants associated 
to the onset and ‘coda’, understood in SGP to be the rhymal complement. 
Traditionally, ‘coda’ consonants have been understood to be weak because 
they are subject to both the Constituent Government (CG) of the preceding 
nucleus, which licenses its complement, and the Transconstituent Government 
(TG) of the following onset, as exemplified in (1). 


(1) O R o ji 

N N 

x O X>XE-X xX 
vcn 


However, the analysis of clusters and codas has evolved since this period and 
different proposals for the analysis of weak segments such as codas have been 
made, especially in lateral theories such as Strict CV which do away with 
the skeletal slot as well as projection, and therefore also cannot model coda 
consonants as rhymal adjuncts in the way that standard GP does. 


Issues addressed in Part 2 


The chapters in Part 2 focus on the behaviour and representational analysis of 
different segmental combinations and their constituent structure. Pöchtrager 
(Chapter 12) revisits the issue of representing light diphthongs (tradition- 
ally represented as two unfused melodies attached to a single skeletal point), 
shedding light on the properties observed both in light and heavy diphthongs, 
evincing typological issues from English, French, and Japanese. This chapter 
crucially parallels the pattern of diphthongs to the asymmetry of (final) empty 
nuclei (for example, lampø, *laprøØ) elucidating the difference between empty 
nuclei, complex nuclei, and simplex, realised nuclei. 
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Harris and Chiu (Chapter 10) provide experimental results from Cypriot 
Greek listeners, which revisits the necessity of a one-size fits all approach to 
syllabifying consonant clusters as complex onsets. An intriguing result of a 
dichotic-listening experiment with Cypriot Greek speakers furnishes support 
for the unique behaviour of sibilants in #sC clusters, reinforcing the possibility 
that these clusters are not branching onsets. 

Multiple contributions within this section consider vowel length in 
relation to stress assignment and morphological operations. Fathi (Chapter 9) 
presents a re-examination of Cairene Arabic focusing on accent and aspects of 
the vowel length in English and Cairene Arabic. Crucially, a re-examination 
of the facts as they have been presented in the phonological literature provides 
not only a promising understanding of the vocalic phonology of Cairene 
Arabic, but also a fresh account of accent assignment in Cairene Arabic as 
linked to prominence. Fathi questions the alternating length of vowels in 
Cairene Arabic and claims that the vowels are underlyingly long. Unlike 
McCarthy (2005), the vowels do not undergo shortening, and length is one 
of the bundled properties of stress, along with pitch-induced prominence and 
an increase in duration. In another fundamental re-analysis, Lowenstamm 
(Chapter 11) presents a templatic analysis of English irregular weak verbs, 
proposing that they are similar to Arabic segholates, focusing on verbs 
involving vowel length alternations, as in keep/kept. The templatic treatment 
also accounts for the activity of voicing of the stem-final consonant and the 
past morpheme in ‘regular’ verb forms, providing a full story for the behaviour 
of vowel length, clusters, and voicing for verbs that would normally be 
considered suppletive forms. 

Two proposals consider the utility of applying a strict CVCV template 
to English vowels and stress. Quantity effects in Strict CV are often dealt with 
by Incorporation (Ulfsbjorninn 2014; Faust and Ulfsbjorninn 2018), where a 
metrically significant empty V-slot is ‘identified’ by the adjacent filled V-slot 
projected. Balogne Bérces and Ulfsbjorninn (Chapter 8) connect the Current 
Southern British English process of Prevocalic Tenseness (PT), that is, 
monophthongisation, locating the language in the typology of ‘long-vowels’ 
distribution. The analysis of PT offers the use of universal fixed parameter 
hierarchies. This is a model of cross-linguistic variation based on Charette’s 
formal typological distribution (Charette 1990, 1991, 1992). A revisitation of 
Charette’s Licensor Projection (1990) leads Yoshida (Chapter 13) to propose a 
new tier where prosodic heads meet, where only the licensor Vs, for example, 
the head member of a ‘long vowel’, are projected. Analysis on Japanese 
licensor Vs and licensed Vs involved in geminates, ‘long vowels’, ‘moraic’ /N/ 
and devoiced high-vowels, is now reflected on English stress-assignment 
focusing on underlyingly long vowels. 
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Prevocalic tenseness in English, 
binarity and the typology of long 
vowel distributions 


Katalin Balogné Bérces and Shanti Ulfsbjorninn 


Introduction 


This chapter aims to provide a better understanding of binarity, long 
vowels, diphthongs, and hiatuses in the framework of Strict CV Phonology 
(Lowenstamm 1996; Scheer 2004). It offers an explanation for Prevocalic 
Tenseness (PT) in Standard Southern British English (SSBE) by embedding 
the SSBE facts into a working typology of positional restrictions on 
long vowel distributions. Our analysis draws on insight coming from two 
directions: on the one hand, we resort to a novel mechanism for formalising 
typological variation and implicational relationships: (Phonological) 
Parameter Hierarchies (Ulfsbjorninn 2014, 2017; Benz and Ulfsbjorninn 
2018; cf. Vaxman 2018); on the other hand, we utilise a Strict CV interpreta- 
tion of quantity: Incorporation (Ulfsbjorninn 2014; Faust and Torres-Tamarit 
2017; Faust and Ulfsbjorninn 2018). This model provides an analysis where 
English vowel length (not tenseness) is contingent on English’s binary phono- 
logical quantity. This resolves a long-standing paradox created by moraic 
theory: if English allows trimoraic syllables CVVC (in words like shoulder), 
why doesn’t it allow CVVV(CV)? 

The problem stems from the fact that, a priori, Strict CV cannot 
distinguish diphthongs from hiatuses or VC sequences due to its recasting 
of constituency into flat dependency (Scheer 2013). However, the PT- 
cum-monophthongisation, that is, vocalic spreading, data (see below, section 
‘VV in current SSBE’) demands an interpretation of this contrast, since 
monophthongisation affects V+schwa sequences but not V:+schwa ones. 
Apparently only binary vocalic spreading is possible; however, as we show 
in the section ‘Long vowel typology and the parameter hierarchy’, below, 
this does not follow from the distributional restrictions on V:s. In English, a 
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vowel may spread to any V position that is (a) licensed by a filled vowel, (b) in 
absolute word-final position, or (c) before a Final Empty Nucleus (FEN) (but 
not before a Medial Empty Nucleus). Accordingly, there would be nothing 
improper about having ternary monophthongisation. 

To restrict monophthongisation in these forms, we first propose that (in 
English) Incorporation is obligatorily binary, even if ‘superheavy’ syllables 
are attested. Despite appearances, but without contradiction of any attested 
phonological behaviour, there are only two degrees of quantity in English: 
Heavy (VC, VV) > Light (V). Secondly, we propose the restriction that 
vowels can only spread into incorporated positions (cf. Arabic; Faust and 
Ulfsbjorninn 2018). Since incorporation is binary, this will make vowel 
lengthening binary, thereby cementing the state where English only has 
long tense prevocalic vowels. This interacts with an unrelated process 
whereby high vowels become tense when they spread into empty onsets 
forming glides (VCs), thus robbing English of its only other prevocalic lax 
vowels. 


Background 
VV in current SSBE 


We deal with current SSBE (referred to as General British/GB in, for example, 
Cruttenden 2014; see ‘Accent’! in Szigetvári and Lindsey 2013- for a socio- 
linguistic definition and a brief historical background, including its relation to 
classic RP). 

Wells-era RP (for example, Wells 2008) exhibited an exception- 
less static distribution affecting lexical vowel hiatuses, standardly called 
Prevocalic Tenseness (PT). 


(1) Prevocalic Tenseness: in English the first member of a hiatus, if 
stressed, is always tense. 


Accordingly, the first member of a hiatus was either a closing diphthong (as in 
crayon, hiatus”, voyage, Noam, flower) or /i:, u:/ (phonetically of diphthongal 
quality: [1j] and [uw, uw], respectively; for example, Leo, ruin) (see Balogné 
Bérces and Szentgyörgyi 2006, 43). 

However, in current SSBE, PT also applies to unstressed vowels. 
‘HAPPY-tensing’ ([1] — [i]; Wells, 1982, 257-8) is not only restricted to word/ 
stem-final unstressed position but it has extended to pretonic prevocalic (for 
example, react, radiation) and non-pretonic prevocalic (mostly, pre-schwa) 
positions (for example, glorious), as well as to unstressed /u/ (for example, 
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thank you; tuition, situation; influence) (see Wells 2008). We summarise these 
developments in (2). 


i KIT FLEECE ý 
(2) a. Pretonic tenseness zor ~ COOSF V 


b. Pre-schwa tenseness %5% — PAE / _o 


(1) and (2) combined, the result is a system where all prevocalic vowels, 
stressed and unstressed, are tense. Consequently, only tense vowels precede 
schwa. Moreover, in this variety, there are no prevocalic schwas, for example, 
meow *¥/mə.áu/, for independent reasons (see section ‘Pretonic PT’, below). 
Throughout the chapter, ungrammatical examples are shown marked by an 
asterisk (*). 

Yet another process affects VV sequences in current SSBE. The 
centring diphthongs of classic RP (that is, tautosyllabic V+ə strings) variably 
undergo monophthongisation: descriptions do not agree on the extent of this 
change.’ We identify SSBE as the variety in which all three relevant lexical 
sets have completed it (and which is, therefore, closest to the one modelled 
by Szigetvári and Lindsey 2013-—). The processes in question are illustrated 


in (3)-(5). 

(3) SQUARE esorea—[e:] e.g., stairs, parent, Hungarian 
(4) NEAR 19 —[t} e.g., Lear/leer, hero, sincere 
(5) CURE (ore) —[o:] e.g., poor, sure, tourist, security 


Note that all the original centring diphthongs contain a lax vowel as 
the first term (due to the historical process ‘Pre-schwa Laxing’; Wells 
1982) — consequently, the monophthongisation eliminates lax vowel + schwa 
sequences. Also, as remarked by all major sources including Wells (1982, 
288) and Cruttenden (2014, 84), the monophthongisation affects V+schwa 
sequences but not V:+schwa ones, which leads to the crucial difference 
between diphthongs and hiatuses in (6). 


(6) (a) [lia] — [k] Lear/leer vs. [lita] Leah 
(b) [foo] — [fo:] sure vs. [suo] sewer 


In sum, it appears that in current SSBE, PT has gained such a general scope 


that it now also applies in all VV sequences irrespective of stress relations 
between (or of the (un)reduced nature of) the vowels involved. There are also 
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reasons to assume that the monophthonging process in (3)—(5) is underway 
in order to enforce it by getting rid of the ill-formed lax vowel + schwa 
sequences, while retaining the lexical set contrast between diphthong and hiatus 
of RP. 

This is hard to understand in English, however, because we could have 
expected smoothing to completely remove vowel-schwa sequences. Instead, 
smoothing in sewer-words is not even incipient.> This is also theoretically 
puzzling since a flat Strict CV (Scheer 2013) does not make an a priori 
distinction between diphthongs and hiatuses. One might imagine that the 
impediment would be a ban on super-heavy syllables; however, prima facie 
English does allow trimoraic syllables: shoulder, mound, oinking (verb), 
and a simple ban solely on ternary long vowels would appear circular or 
stipulative. Furthermore, from the Strict CV perspective, a ban on V:: cannot 
be constructed from English-specific settings of the typology of long vowel 
distributions. 


Phonological parameter hierarchies and long vowel typology 


Based on the formal typology that is original to Charette (1990, 1992), it is 
argued in Ulfsbjorninn (2014, 2017) and Benz and Ulfsbjorninn (2018) that 
the phonological parameter settings responsible for cross-linguistic structural 
differences are organised into universal implicational hierarchies. 

We assume a CVCV skeleton, whose slots may be associated with 
melody (constituted by phonological expressions — see the segment symbols 
used as shorthand notation in our representations) either lexically or via 
spreading, or remain empty. Both spreading and emptiness are constrained by 
parametric requirements. Of immediate relevance to the present discussion are 
those parameters, referred to in the theory as ‘licensing’ (Scheer 2004), that 
are responsible for the well-formedness of long vowels. 

There is a well-established cross-linguistic observation that if a language 
allows V:s then it universally allows V:s preceding a filled V (as in hypo- 
thetical [ba:ra]; see (7)) — we therefore conclude that the most fundamental 
licensor type for V spreading is a filled V that immediately follows the long 
vowel (Yoshida 1993; Kaye 1995; Scheer 2004). 


(7) The site of a V:’s spreading is licensed by the following V 
Lic 
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However, we note that cross-linguistically long vowels are also found in other 
environments, which we will assume require further parametric specification 
on Licensing. These include a V: before a Medial Empty Nucleus (MEN) 
producing forms like [ba:m@pi] (8a) and a V: before a Final Empty Nucleus 
(FEN), leading to forms like [ba:m@] (8b); and whether a FEN can be the 
target of spreading as in [bama:] (9). 


(8) V: licensing by EN 
a. Medial Empty Nucleus (MEN) 
cC V EEAS vc V 


D | I 


b a m p i 


b. Final Empty Nucleus (FEN) 
C V E V 


h tee j 


b a m 


(9) FEN is part of the V: 
C V C V C Veen 
n a te 


We will show that these environments are actually implicationally related to 
each other. To capture this formally, we propose that there is a fixed hierarchy 
among these V: licensing parameters. 


Long vowel typology and the parameter hierarchy 


The chart in (10) below summarises the positional restrictions on long vowels 
that we identify in the language types specified in the first column. The other 
three columns provide the environments (that is, licensing by MEN (8a), by 
FEN (8b), and FENinV: (9), respectively) in which language data are either 
attested (indicated with ‘W’) or not (‘X’). 
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(10) Positional restrictions on V: 


Env V:C.CV/# V:C# V:# 
Lang ba:mpi/O ba:m bama: 
Licensor type MEN FEN FENinV: 
Type 1 
Chugach, Italian x x x 
Type 2 
Turkish x x v° 
Hausa x x v 
Type 3 
Icelandic x’ sy Jv 
Type 4 
Cairene Arabic x y x 
Type 5 
Palestinian Arabic s5 y x 
Type 6 
Hungarian’, Pulaar J J J 


On the basis of this typology, we establish a parameter hierarchy diagrammed 
in (11), where the stacked parameters are shown together with an example 
language.!° Height in the hierarchy is graphically represented: higher- 
stacked parameters are positioned above lower-stacked ones to symbolise the 
dominance relation, from which the implications follow (lower implies higher). 
The highest parameter is Filled (see 7), located at the top of the hierarchy. 
Whenever the setting is ‘yes’ for a given parameter, the subordinate parameter 
begs to be set. Note that Empty implies Filled, and Medial implies Final.'! 


(11) Parameter hierarchy for long vowels (English settings underlined) 


Filled no (no V:) Spanish 
no —— FENinV-: ------- yes------ Final Empty — no (ba:ma only) Italian 
yes VES ------=--- Medial Empty — no 
(maba:) (maba:m) 
Hungarian yes Cairene 
(ba:mpa) 
Hungarian 
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An additional function of the diagram in (11) is to indicate V: settings for 
English (in double underline). Given these parameter settings, a vowel in 
English may spread to any V position that is (a) licensed by a filled vowel; 
(b) in absolute word-final position; or (c) before a FEN but not before a MEN. 
A significant consequence of this for spreading is that, theoretically, (b) and 
(c) may combine in the way shown in (12): a FEN is capable of licensing 
spreading from V, to V, and receive docking melody at the same time. This 
creates the (false) grammaticality of ternary monophthongs. 


(12) Ternary monophthong ([lr::]) 
Lic 


One way out of this problem is to simply stipulate that vocalic spreading is 
binary in English. However, an alternative solution that uses an independently 
justified (though apparently unrelated) mechanism is possible. 


Why is long vowel spreading binary? Quantity 


Quantity and stress in Strict CV 


In Strict CV, stress is regulated by parameters on the projection of V-slots, 
filled or empty (Larsen 1998; Yoshida 1999; Harris and Gussmann 2002; 
Scheer and Szigetvari 2005; Charette 2008; Ulfsbjorninn 2014; Faust and 
Ulfsbjorninn 2018). 

In English, all light nouns tend to have antepenultimate stress (for 
example, c[a]/ibre ‘calibre’), and word-final singleton consonants (onsets) are 
‘extrametrical’ (Charette 1984; Burzio 1994; Harris 1994; Balogné Bérces and 
Szentgyörgyi 2006), for example, p[é]/ica<n> ‘pelican’. But the language is 
quantity-sensitive, therefore heavy penultimate syllables tend to attract stress: 
mar{i:]na ‘marina’, cor[su]na ‘corona’, pol[én]ta ‘polenta’. Final long vowels 
are always stressed, for example, jambor[i:] ‘jamboree’. 

Quantity effects in Strict CV are handled by the mechanism of 
Incorporation (Ulfsbjorninn 2014; Faust and Ulfsbjorninn 2018), where a 
metrically significant empty V-slot is ‘identified’ by the heightened projection 
of an adjacent filled V-slot. The V-slots of the incorporation environment/ 
domain are shown grey-shaded beneath. 
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(13) Incorporation in Strict CV 
a. Project a V to L3 if it precedes an empty V 


b.  polénta 
3 *) 
2 * x * 
1 x x Ti * 
0 


CVCWVCVC v 
hod ol el || 
po len t ə 


English does have (C)VVC.CV syllables; however, there is no phono- 
logical behaviour that identifies these as super-heavy. Although segmentally 
‘syllables’ may be large, in terms of quantity, there is only heavy and light. 
Pulaar and Kashmiri, in contrast, do show that CVVCs are heavier than heavy 
syllables (CVV > CVC) since the former outcompete the latter in stress 
assignment. This shows that, formally, even empty V-slots that are not strictly 
local to a filled V can be incorporated. However, English incorporation is 
strictly local, resultantly quantity is strictly binary. 


(14) Incorporation (binary) 
Incorporation is strictly local. 


(15) CVVC and incorporation: [ foulds] shoulder 
3 aa ie 


2 
1 * *) * * 
0 


Given this metrical condition on quantity, we propose the following condition 
on vowel length in English (identical to that of Arabic, see Faust and 
Ulfsbjorninn 2018). 


(16) Long vowel spreading only targets incorporated positions. 


Effectively, vowel length is piggy-backing on quantity. This explains why 
smoothing targets ‘diphthong’ schwa (17a) but not hiatus schwa (17b). 
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(17) a. [lr] Lear/leer b. [lia] Leah 


"i i 
* * 
* * * * * 
a a 
c lz c WME C V, 
| le D= a | 
1 1 ə l i ə 


Pretonic PT 


To complete the picture, we now turn to the cases of what we call meow-words 
(introduced in (2a) above) (18a). Here we observe that in SSBE these lax 
vowels have tensed and glided into the onset of the oncoming hiatus (18b). 
For some speakers, this is further accompanied by (near) loss of the vocalic 
position (‘High vowel gliding’ — 18c). 


(18) Pretonic PT 
a. *mı.áu / *me.áu / *mə.áu 
b. [mïáw] meow 
[te"ifon] tuition 
c. [mjaw] meow 
[twifon] tuition 


For reasons that are not clear, it seems that gliding is always accompanied 
by the tensing of the vowel. This may be perhaps related to the fact that 
effectively no language contrasts tense/lax consonantal glides (though as 
consonantal high vowel analogues such a contrast is conceivable). 


(19) Pretonic tenseness: [milaw] ‘meow’ 


3 r) 

2 * x 

1 * * a 
0 


C V C VM C V 
A 
m 1 a u 


Whatever the explanation, this tensing removes the only other source of 
pre-vocalic lax vowels in the phonological system (without entering into a 
discussion of triphthongs, which would take us too far afield). 
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Conclusion 


PT is nearing completion as a process in SSBE. Some of this has been achieved 


through smoothing of pre-schwa diphthongs. However, no such smoothing 


is even incipient in hiatus+schwa sequences. Lax vowels are eliminated 
before schwa but the language forbids the creation of CVVV sequences thus 
preserving tense vowel+schwa hiatus sequences. Blocking CVVV sequences 


does not follow from typological or even language-specific bans on this 


syllable type. We claim, however, that vowel length in English is inextricably 
tied to quantity, and quantity in English is binary. Although English has CVVC 
syllables, these are just ordinarily heavy and there are no CVVVs. Moreover, 


Pretonic PT comes as a product of gliding into a stressed syllable. 


(20) Contexts Process 


Pretonic lax (e.g., tuition, create) >  Tensing 
Pre-schwa lax (e.g., bear, beer, sure) > Deletion of schwa 
Pre-schwa tense (e.g., Leah) 


Distribution Lax and tense before vowels No lax before vowels 


Notes 


i; 
2. 


$2 S09 Sy 


http://cube.elte.hu/accent.html. 

It is irrelevant to the present discussion in what sense the first vowel of hiatus is stressed — it is an 
unreduced vowel. 

Of the three lexical sets, Wells (1982, vol. 1, especially 3.2.7) only discusses [9:] as an RP pronunciation 
variant of CURE (referring to the change by various names incl. CURE-FORCE/THOUGHT Merger, 
Second FORCE Merger, CURE Lowering), the other two are centring diphthongs in his analysis. 
More recently and in contrast, Cruttenden (2014) considers the SQUARE vowel to have completed 
the monophthongisation, and already transcribes it as /e:/, noting that ‘[o]lder speakers of GB may 
have a diphthong [eo]’ (Cruttenden 2014, 118); the other two are treated in his system as centring 
diphthongs (2014, 84-5), and the monophthongal pronunciations are only mentioned as marginal 
variants (2014, 154, 156). CURE monophthonging is considered highly variable in Cruttenden (2014, 
156), and the variation in lexical incidence observed with CURE is also mentioned by Wells (1982, 
vol. 2, 4.1.5). In Szigetvari and Lindsey (2013-), all three are long monophthongs, transcribed as 
shown in (3)-(5), with the note in the ‘Accent’ section that in CURE, ‘[w]Jhen preceded by a palatal 
consonant, many speakers have [o:], for example, Europe, security, during, mature’. 

[jə] when heavily accentuated. Classified as a ‘broad London’ pronunciation in Cruttenden 
(2014, 154). 

In some cases, such as theatre [i:ote or Oieto or Orta] there is variation in the lexical set of the 
item; however, smoothing only occurs in cases where this token has changed into the NEAR-set 
(diphthong, never directly from a hiatus source) *th[i:(:)]tre. In a later change, some speakers with 
NEAR smoothing have begun to lax the Leah set. 

Only in monosyllables. 

However, the language does allow mysterious light rising diphthongs (Árnason 2011). 

These are only permitted in derived words, however. 

Length in this environment can be extremely restricted. However, we are only looking at positional 


environments, not melodic ones. The presence of even one V: in this syllable structure configuration 
would be enough to set the parameter to <yes>. 
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10. These are used in syntax (Baker 2001; Biberauer et al. 2013; Sheehan 2014; pace Boeckx and 
Leivada 2013). 

11. Interestingly, the implicational statement ‘Empty implies Filled’ opens the door for third factor 
explanations. First, Emptiness is more marked than Filledness considering that phonology is a 
module devoted to externalisation (Chomsky et al. 2002; Chomsky 2005) and therefore phonological 
objects that will receive no phonetic interpretation need extra licensing and extra stipulations in the 
grammar (not less). Cyran (2003) discusses complexity scales and markedness in a closely related 
way. However, Medial implies Final does not seem to follow so easily from general principles, which 
begs the question: how did it become this way? Crucially these questions can only be asked with 
such clarity due to the formalism employed in the analysis of the typology (specifically Strict CV 
principles). In an instance of McCarthy’s famous edict: ‘if the representations are right, the rules will 
follow’ (1988, 84). 
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Vowel length and prominence 
in Cairene Arabic 


Radwa Fathi 


Introduction 


Cairene Arabic implements the characteristic alternating pattern documented 
in (1). The usual view (implicit in the representations below) is that the final 
vowels in the words at the left in (1) are short whereas the same vowels are 
long in the various environments on the right. 


(1) 
i. ¢a8a ‘dinner’ vs. GaSa:k (CaSa+k) ‘your (masc.sg.) dinner’ 
ii. kalti ‘you (fem.sg.) ate’ vs. makaltí:š (ma+kaltit+ š) ‘you (fem.sg.) didn’t eat’ 


iii. rámu ‘they threw’ vs. ramú:ha (ramu+ha) ‘they threw it (fem.)’ 


Some process is evidently at work in (1). Repeated application of that process 
is illustrated in (2). 


(2) 

i. katabu ‘they wrote’ (katab-u) 

ii. katabu:ha ‘they wrote it (fem.)’ (katab-u+ha) 

iii. katabuha:li ‘they wrote it (fem.) to me’ (katab-u+ha+1+i) 


iv. makatabuhali:’ ‘they did not write it (fem.) tome’ (ma+katab-u+ha+l+i+8) 


One question raised by the data in (1) and (2) is the directionality of the process: 
are the alternating vowels underlyingly short and lengthened in a specific 
context? Or are they underlyingly long and shortened under definable conditions? 

Watson (2002) argues that vowels behaving as shown above are under- 
lyingly short and lengthen (cyclically) before a suffix on account of a rule of 
Pre-Suffix Vowel Lengthening, for example, /katabut+ha/ — [katabú:ha], / 
katab-ut+ha+li/ — [katabuhd:li], etc. Pre-Suffix Vowel Lengthening in turn 
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feeds arule of Unstressed Vowel Shortening which forces all vowels lengthened 
on a previous cycle to resume their original brevity; thus [katabwha:li], not 
*[katabu:ha:li], [makatabuhali:8], not *[makatabuw:ha:li:3], etc.! 

McCarthy (2005) rejects Pre-suffix Vowel Lengthening, arguing that 
the final vowels targeted by the process illustrated in (1) and (2) are underly- 
ingly long, then shortened in the appropriate environment. 

I accept McCarthy’s view that the alternating vowels are underlyingly 
long, though I will argue that they do not undergo shortening. My claim, in 
other words, is that the alternations in (1) and (2) are not length alternations 
at all. In the next section, I review McCarthy’s arguments. In the next two 
sections, I re-evaluate the evidence, and develop the idea that ‘length’ has 
very little to do with the alternations in (1) and (2). Pitch-induced prominence, 
I claim, is key to their understanding. In the section that follows this, I discuss 
an empirical consequence of my proposal.” 


McCarthy’s proposal 


McCarthy’s argumentation proceeds in two steps. In a first step, he shows 
based on theory internal considerations how the underlying length of pre- 
suffixal vowels follows as a theorem from OT. The second step consists 
of matching this result with evidence. I briefly review two of the empirical 
arguments adduced by McCarthy in the course of that second step. 


Argument 1 


Syncope deletes unstressed high vowels in environment VC_CV, as in 
for example, húwwa m_darris ‘he (is a) teacher’ (< /húwwa + mudarris/) 
or Sir_b il?áhwa ‘he drank the coffee’ (< /Sirib + il-?Pahwa/). Word-final 
unstressed high vowels, however, never delete in that very same context; thus, 
katabu gawa:b ‘they wrote (a) letter’, and never *katab_gawa:b and tiktibi 
gawá:b ‘you (fem.sg.) write (a) letter’, and never tiktib_ gawa:b. According to 
McCarthy, the fact that the vowels of the last two examples resist syncope is a 
direct reflection of their underlying length (McCarthy, 2005, 18). 


Argument 2 
Here, I quote from McCarthy (2005, 20) verbatim: 


(3) A [third] point in support of an analysis with underlying stem-final 
long vowels is the existence in Cairene of a derived contrast between 
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tense and lax short high vowels (Mitchell 1956: 10-11, 112). Long 

i: and u: are pronounced as the tense vowels [i:] and [u:], while their 
short counterparts i and u are pronounced as the lax vowels [1] and 
[v]. But when a surface short vowel is derived from an underlying 
long vowel, it is pronounced as tense.‘ For instance, Sili:h ‘take (fem.) 
it (masc.) away!’ is pronounced as [Sili:h] and not *[Sili:h]. That is 
because the vowel of the first syllable is underlyingly long and has 
been shortened in an unstressed syllable, as shown by forms like 81:1 
‘take away (masc.)’. The same goes for [sibha] ‘leave it (fem.)!’, 
where /i:/ has been shortened in a closed syllable. 

According to Mitchell, final short i and u are also pronounced as 
tense: [Si:li] and not [Si:I1] for ‘take away (fem.)’ [...], this observation 
suggests that final vowels are tense because they are underlyingly 
long. 


McCarthy’s arguments are quite insightful. The fact that stem final high 
vowels defy syncope and retain their height and tenseness is fully consistent 
with the conjecture that they are underlyingly long and fully inconsistent with 
the alternative. But, as McCarthy notes, his two arguments involve a paradox: 
those vowels behave just like long vowels, as he points out, but they are not 
long. I intend to reappraise McCarthy’s interpretation, but before I do this, a 
point of fact demands attention. 

McCarthy’s second argument above mentions two statements from 
Mitchell (1956): 


(4) The qualities of ii and i in Si:li ‘take away (fem.)!’, Sili:h ‘take 
(fem.) it (masc.) away!’, siib ‘leave!’ and sibha ‘leave it (fem.)!’ 
are substantially the same, and the 7 of, for example, sibha is not 
pronounced as i in, for example, bint ‘girl’ (Mitchell 1956, 112). 


ii is regularly shortened in certain contexts, for example, Silhum 
‘remove them!’, but the sound of the vowel remains as for ii 
(Mitchell 1956, 10-11). 


Mitchell is quite right about ii and i being the same in ší:li, in šilí:h, and in siib. 
On the other hand, he is entirely incorrect about the quality of the first vowel 
in silhum or sibha: that vowel is definitely not the same as the first vowel 
in Silizh or the last vowel in Si-/i; rather, it is the same as the vowel in [bent] 
(or [brnt]).° Indeed, a rule exceptionlessly shortens and lowers long high 
vowels in non-final closed syllables: /8i:1+hum/ — [Selhum] (or [S1lhum]), 
/si:b+ha/ — [sébha] (or [srbha]). 
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That correction will prove important in the next section. We can now 
return to the evidence considered by McCarthy and re-evaluate it. 


Re-evaluating the evidence 


How can we detect brevity? The data discussed so far provides two 
unambiguous indications of how true brevity is reflected in the pronuncia- 
tion of underlying high vowels. First, if the vowel is underlyingly short, it 
redundantly lowers to the mid-range as in, for example, /zimi:l/ — [zemi:1] 
(or [zimi:l]) ‘colleague’, /zibu:n/ — [zebu:n] (or [zibt:n]) ‘client’, etc. 
Second, when the vowel is underlyingly long and ends up in a non-final 
closed syllable, it shortens. This is one of the most solidly established 
generalisations in the phonology of the Semitic languages. Shortening 
redundantly entails lowering to the mid-range, as could be seen with the 
example of, for example, [sébha] (< /si:b+ha/). In neither case does height 
survive. Evidently, a Cairene Arabic vowel cannot maintain its underlying 
height whether it was short to begin with or if it underwent shortening. 
I submit that (5) holds. 


(5) Height is licensed by length 


In light of this, what can we make of the first vowel of, for example, sibi: (h) 
‘leave (fem.sg.) him!’ or the last vowel of, for example, s7:bi ‘leave (fem. 
sg.)!’ which have retained their height and their tenseness even though they 
supposedly lost all length? One approach is to simply label that phenomenon 
‘opaque’, a mere observation which leaves the problem intact. An alternative 
is to question whether those vowels really lost their length and consider the 
possibility that they behave as long vowels because they are long vowels. 
I will explore the hypothesis in (6). 


(6) a. Height is licensed by length (5). 
b. The vowels underscored in sibt:(h) and si:bi have retained their 
height. 
c. Therefore, they are long. 


In the rest of this chapter, I explore the consequences of a strong version of 
(6c): 


(7) The grammar of Cairene Arabic includes no rule of unstressed vowel 
shortening. 
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At first sight, and in view of the facts of (1) and (2), arguing for (7) looks 
like an uphill battle. However, in order to evaluate the plausibility of (7), we 
have to compare the weight of the various pieces of evidence available to us. 
With /zimi:1/ — [zemi:1], we have incontrovertible indication that its first 
vowel is short. Not only does it lower, it syncopates at the first opportunity: 
ya z_mi:li ‘O my colleague!’ (< /ya + zimi:l+i/). With /si:b+ha/ — [sébha], 
we are dealing with one of the sturdiest phenomena of Semitic phonology, 
the shortening of a long vowel in a non-final closed syllable. This is all hard 
phonological evidence. 

When it comes to the putative brevity of the first vowel in sibi:(h) and 
the last vowel in sí:bi, phonological evidence clearly militates against their 
brevity. Not only do they retain their height and tenseness, but they also cannot 
syncopate as short vowels do; hence konti sibi:h ‘you (fem.sg.) should have let 
him’ and not *kónti s_bich (< /kúnti + sibi:h/). The only reason to insist never- 
theless that those vowels are ‘short’ lies outside the realm of phonological 
behaviour. It is the auditory percept that they ‘sound’ short, for instance when 
compared to the stressed vowels (underscored) in the same words, sibi.(h) and 
si:bi. But to draw any conclusion from that auditory difference we would have 
to be sure that what we hear in sibi:(h) and si:bi is pure ‘vowel length’. This is 
precisely what I intend to question. 

In the next section, I claim that the difference is one of prominence, not 
one of length. 


Prominence, not length 


It has long been recognised that Cairene Arabic stress assigns dramatic saliency 
to one single syllable per phonological word. The correlates of saliency are 
melodic height,° loudness, and duration (Mitchell, 1975, 94). In a meticulous 
and detailed study, Hellmuth (2006) analyses the system as a stress-accent 
system with postlexical intonational pitch accents. She construes pitch-accent 
induced salience in terms of a Low High contour tone (henceforth LH) 
associated to the metrical foot and eventually marking the stressed syllable. 

Hellmuth’s instrumental findings show how the LH melody differ- 
entially associates to syllables depending on their geometry, CVC, CV, or 
CVV. Her examples are mdnga ‘mango’, mdlek ‘king’, and má:leħ ‘salty’ 
(noted malik and ma:lih by Hellmuth). In all three cases, the Low tone aligns 
with the onset (or very shortly thereafter). On CVC syllables, the High tone 
aligns shortly after the coda (mdnga). In CV syllables, H aligns shortly after 
the middle consonant (malik). On CVV syllables, by contrast, H aligns a bit 
before the middle consonant (ma:lih). 
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Abstracting away from fine-grained phonetic distinctions, a representa- 
tion of a type familiar from the description of tone languages can be inferred 
from Hellmuth’s results:” 


All stressed syllables in (8) are salient as against unstressed syllables. 
However, the configuration in (8c) stands out against the other two: the high 
tone is entirely absorbed by vocalic material. For easier reference, I call such 
a syllable prominent as against all other syllables. This is where pitch will 
raise highest, where loudness will be maximal, and duration most increased. 
What we hear in stressed long vowels, in other words, is not their phono- 
logical length but the total bundle of properties I just mentioned, and which 
culminate in prominence. If this is correct, a comparison between the two 
vowels of, for example, s7:bi can only tell us which is prominent and which is 
not; it does not tell us that the non-prominent one is short. Indeed, I claim that 
the two vowels of si:bi or sibi:h differ in prominence but are equally long. For 
graphic convenience, I note prominence Y, where F stands for the configura- 
tion exemplified in (8c) and reproduced as (9a). All vowels associated with 
that configuration will be marked +2, all others (unstressed long and stressed 
or unstressed short) —Y. This is shown in (9) with some of the examples 
discussed earlier. Stressed syllables are noted in bold italics. In (9f, g), I show 
how the vowels of si:bi or sibi:h differ in prominence but not in length. 


(9) a. b. c. d. e. f: g. 
L H 
| A -P -P -P-P +P-P -P-P +P-P +P-P -P +P 
CVV manga malek maaleh katabuuhaa siibii siibii(h) 


On this view, +P vowels stand out against all others. My system thus implements 
the same dichotomy as in classical accounts by Watson or McCarthy but on the 
basis of prominence, not length. All length distinctions are preserved. In a ‘short 
and opaque’ account, it is an accident that such opaque vowels are confined to 
open syllables. In my proposal, there is nothing opaque about the ‘short’ high 
vowels which behave as if they were long. They are long (whether they be +P 
as in sibi or —F as in st:bi) and they perfectly fit the classical Semitic pattern 
whereby long vowels exclusively occur in open syllables. 
In the next section, I examine a prediction of my proposal. 
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Duals 


A noun in Cairene can be singular, dual, or plural. The dual is formed by the 
suffixation of +e:n, for example, kalb ‘dog’, kalbé:n ‘two dogs’. The dual 
suffix +é:n always carries main stress. According to the view put forward in 
the previous section, the prominence of +é:n will systematically result in the 
non-prominence of all other vowels of the noun. This can be seen in (10) where 
di:n is +2 in (10a.i), but demoted to -Z when appended by +é:n (10b.i).8 My 
proposal being that A-marking exclusively encodes prominence and leaves all 
other properties of the vowels intact, I predict that vowels marked —F in (10b) 
will remain unchanged under dual suffixation. That is, the duals of all forms 
will have the exact same stem vowel as their corresponding singulars. 


(10) 
a. Singular b. Dual 
+P -P +P 
i. /di:n) [díi:n] ‘religion’ /dinte:n/ — /di:ne:n/ diné:n] ‘two religions’ 
i. /nu:r/ [nar] ight’ /nu:r+e:n/ — /nuire:n/ nuré:in] ‘two lights’ 
iii. /ba:b/ [bá:b] ‘door’ /ba:b+e:n/ — /ba:be:n/ [babé:n] ‘two doors’ 
iv. /be:t/ [bé:t] ‘house’ /be:t+e:n/) — /be:te:n/ beté:n] ‘two houses’ 
v. /den/ [dé] ‘debt’ /de:n+e:n/ — /de:ne:n/ [dené:n] ‘two debts’ 
vi. /yo:m/ [yó:m] ‘day’ /yom+en/ — /yo:me:n/ [yomé:n] ‘two days’ 
vii. /so:t/  [şó:t] ‘voice, /şo:t+e:n/ — /şo:te:n/ şoté:n] ‘two voices, 
vote’ votes’ 


Moreover, consider a minimal pair such as di:n/de:n ‘religion/debt’. Because 
the quantity (and therefore the quality) of stem vowels is unaffected by dual 
formation, the contrast will be reproduced in the dual forms, hence [diné:n] 
‘two religions’ versus [dené:n] ‘two debts’.®'° Both predictions are in 
accordance with the facts. 

My account contradicts Broselow’s (1976, 18ff.). Broselow asserts 
that unstressed long mid vowels shorten and rise to high. Thus, she reports 
yumeé:n and bité:n for the duals of yo:m and be:t, respectively. As well, she 
predicts that /di:n+e:n/ and /de:n+e:n/ will neutralise into [diné:n]. In both 
cases, this is at odds with my experience with the language, but more signifi- 
cantly with the information gathered in countless interviews with other native 
speakers. The confusion, I believe, can be traced back to Broselow’s reliance 
(Broselow 1976, xix) on material from Lehn and Abboud’s Beginning Cairo 
Arabic (Lehn and Abboud 1965). Lehn and Abboud consistently ignore the 
distinction emphasised by McCarthy in (3) above and in Note 4. Because 
their crude orthographic system makes no room for short mid vowels, the 
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only graphic symbols available to Lehn and Abboud are i and u, respectively. 
Evidently, [yumé:n] or [bité:n] with initial high vowels are mere artefacts of 
Lehn and Abboud’s inadequate notational system. !! 


Conclusion 


A number of notions about vowel length and brevity in Cairene Arabic have been 
taken for granted for decades. Here, I have explicitly challenged one such notion, 
viz. ‘length is what we hear in, for example, katabu-:ha’. I have argued on the 
contrary that what we hear in that case is Prominence, not length. My proposal 
also carries an implicit challenge of most accounts of the stress assignment 
system of Cairene. Indeed, in most accounts katabu-ha has penultimate stress 
because of its outstanding length. In my account, the penultimate vowel of 
katabu:ha stands out (as prominent) because it carries stress. !? 


Notes 


1. Note the asymmetrical role of stress in Watson’s account: stress plays no role in the lengthening of 
vowels, but its absence plays a crucial role in their shortening. 

2. The claims I make in this chapter are intended to cover non-low vowels, both front and back. But 
because of space limitations, the discussion has been confined to front vowels. 

3. For the sake of accuracy, I follow Harrell (and many others) in representing as [e] the vowel noted [1] 
by McCarthy. Harrell (1957, 53) describes it as (in): « Free variation from somewhat below [1] to a 
lax [e] ». Because this issue is peripheral to the main point of this chapter, I often quote examples in 
both versions, for example, «...[bent] (or [brnt])...». 

4. Recognising this difference goes a long way towards correcting an unfortunate practice which goes 
back to the elementary manuals of Lehn and Abboud (1965) and Abdel-Massih (1974), and still mars 
virtually all generative work on Cairene: these authors ignore (Lehn and Abboud) or deny (Abdel- 
Massih) the distinction mentioned by McCarthy. As a result, they are incapable of representing some 
of the most basic surface contrasts, for example, the difference between the surface vocalic melodies 
in, for example, [Sili:h] and [zemi:1] (or [zımí:1]) ‘colleague’ both identically noted by them as 
CiCi:C; or in [kalbe bonni] (or [kalb1 bonni]) ‘a brown dog’ versus [kalbi bonni] ‘my dog is brown’, 
an example adduced by Hafez (1996, 8). Note how Hafez renders the epenthetic vowel in [kalbe 
bonni] as [e]. So does Harrell (1957, 60) in sette kwayyesa ‘a good lady’ versus setti kwayyesa ‘my 
grandmother is good’. On this point, see also Mitchell (1993, 128) describing /i/: “the norm is a front 
spread vowel, half close in, for example, Egyptian Arabic fikr ‘idea, thought’, fihim ‘he understood”. 
In (Mitchell 1990, 17), his example of a front half close vowel is French fée [fe] ‘fairy’. 

5. The same error is repeated in Mitchell (1962, 24), though not in Mitchell (1990; 1993). 

6. On melodic height, Harrell (1957, 17—18) reports possible differences of up to one octave between 
the melodic height of a stressed and an unstressed vowel. 

7. While this section owes much to Hellmuth (2006), the interpretation of her work is my own. 

8. The careful reader will have noted that, unlike all other + syllables, the syllable of the dual marker 
seems to involve a coda. In reality, word-final CVVC syllables function as penultimate open syllables 
CVV-C(v)# in Egyptian Arabic as in many other Semitic languages. 

9. As is the case with all subtle distinctions, the [diné:n]/[dené:n] contrast will be lost in very fast 
speech. 

10. Conversely, /di:n+na/ ‘our religion’ and /de:n+na/ ‘our debt’ do neutralise, but into [denna] (or 
[dinna]), not *[dinna]. Harrell (1957, 55) does note /dinna/ with i but it must be kept in mind that 
Harrell’s examples are quoted in their ‘phonemic’ form (Harrell 1957, 6), not as they are realised on 
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the surface. His description of the phonetic form of underlying (or, in his terms, ‘phonemic’) /i/ in 
closed syllables was reproduced in Note 3. 

11. Lehn and Abboud, and Abdel-Massih are actually isolated in their unfortunate practice of noting 
yumeé:n and bité:n. Indeed, most authors note yomé:n and beté:n with no modification of the original 
quality of the initial mid vowel for example, Spitta-Bey (1880, 132), Willmore (1905, 238, 322), 
Jomier and Khouzam (1964, 37), Mitchell (1956) on a dozen occasions, Jomier (1976, 111), Hinds 
and Badawi (1986, 965). 

12. On this, see Fathi (in preparation). 
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#sC in stereo: a dichotic-listening 
study of initial clusters in Cypriot 
Greek 


John Harris and Faith Chiu 


Introduction 


It was once widely held as a matter of self-evident truth that any consonant 
cluster at the beginning of a word forms a syllable onset.! At least for 
languages that impose phonotactic constraints on initial clusters, such as 
English, there is little reason to doubt that this assumption is correct for 
sequences of rising sonority (such as in play, grow, flow). However, the 
assumption has always sat awkwardly with sequences consisting of a sibilant 
plus consonant (‘#sC’), as in spin, sting, skin. One long-acknowledged 
reason has to do with phonological distribution: in the relevant languages, 
#sC is typically alone in defying virtually all the phonotactic restrictions 
that otherwise apply to initial clusters, such as the requirement that complex 
syllable onsets rise in sonority. 

There is mounting evidence from various other sources that confirms 
the syllabically anomalous status of #sC. This evidence, which we summarise 
below, includes the behaviour of #sC in morphophonemic alternations, first 
language acquisition, and speech production. In this chapter, we turn our 
attention to evidence from speech perception. 

Is the anomalous nature of #sC due to some property specific to 
sibilants or to some more general property of clusters that fall in sonority? 
We investigate this question by means of a dichotic-listening experiment. 
Previous studies using this methodology show that English listeners, when 
simultaneously presented with words containing different initial singleton 
consonants to each ear, hear certain pairs as fusing into single-word percepts, 
but not others. Pairs consisting of an obstruent (for example, pay) and a liquid 
(for example, /ay) are readily fused and sequenced in the order obstruent- 
liquid (play), that is, as a complex rising onset (Cutting and Day 1975). 
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However, pairs consisting of [s] (sigh) and another consonant (pie) are much 
less likely to be fused (Chiu et al. 2016). 

To understand whether sibilance uniquely accounts for the non-fusing 
behaviour of #sC, we need to investigate languages which, like English, 
impose phonotactic restrictions on word-initial clusters but which, unlike 
English, also allow non-rising clusters to begin with nonsibilants. Cypriot 
Greek fits the bill. It has a range of word-initial falling-sonority clusters that 
includes not only #sC but also sequences starting with nonsibilant fricatives 
(#fC), such as [fc] and [xt]. Morphophonemic evidence, which we detail 
below, indicates that #sC and #fC in Cypriot Greek pattern together and are 
not syllabified in the same way as rising-sonority clusters. 

Native Cypriot Greek listeners were dichotically presented with pairs of 
nonwords containing different initial singleton consonants in varying sonority 
combinations and then asked to report what they heard. In line with previous 
studies of English, obstruent-liquid pairs were found to favour fusion and, 
when fused, were more likely to be sequenced as rising clusters. In contrast, 
neither of these biases was observed in fricative-stop pairs. Tellingly, the latter 
result was found for both sibilant and nonsibilant fricatives. This co-patterning 
is consistent with the shared morphophonemic behaviour of #sC and #fC and 
suggests that neither is syllabified like rising onset clusters. It also suggests 
that sibilance is not special and that there is some more general constraint 
disfavouring the syllabification of falling clusters as complex onsets. 

In the next section we review the evidence against syllabifying #sC 
as a complex onset and consider what properties might make it syllabically 
anomalous. In the third section we describe the morphophonemic behaviour 
of initial clusters in Cypriot Greek. The fourth section summarises the 
methodology and general findings of previous dichotic listening experiments. 
In the fifth section we present our dichotic listening study of Cypriot Greek. 
The final section lays out our main conclusions. 


#sC 
Syllabifying #sC 


In what follows, we will use the conventional notion of sonority to describe the 
various phonotactic configurations found in word-initial consonant clusters. 
This is purely a matter of descriptive convenience and is not meant to imply an 
assumption that sonority corresponds to some independent phonetic property 
(for differing views on this matter, see Parker 2012). 

Our interest in #sC will be focused entirely on clusters of falling 
sonority, that is, clusters consisting of sibilant plus stop (for example, 
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[sp, st, sk]). We will have nothing to say here about rising-sonority clusters 
consisting of sibilant plus sonorant, such as are found in English (for example, 
slide, snide, smile). There are clear phonological differences between these 
two types of cluster. For one thing, some languages permit initial sibilants to 
be followed by a stop but not by a sonorant. 

Besides the anomalous phonological-distributional characteristics 
mentioned above, there are three main sources of evidence against the 
traditional view that #sC syllabifies as a complex onset, just like initial rising- 
sonority clusters. 

The first piece of evidence comes from syllabically sensitive morpho- 
phonemic alternations that discriminate between #sC and rising clusters. 
Perhaps the best-known illustration comes from the behaviour of proclitics in 
Italian, which mould themselves to the initial syllable structure of a following 
stem (Davis 1990; Bertinetto 2004). The alternants occurring before #sC differ 
from those occurring before singleton consonants or rising clusters and more 
closely resemble those occurring before vowels, for example, [il se:no] ‘the 
mark’, [il tre:no] ‘the train’ versus [lo spa:ro] ‘the shot’, [1 amiko] ‘the friend 
(m.)’. Since the vowel-initial environment is a syllable rime, a reasonable 
conclusion is that the [s] of #sC also falls within a rime (Davis 1990; Kaye 
1996). That is to say, the [s] of #sC occupies a coda at the beginning of 
the stem. Similar syllabically sensitive alternations and distributions that 
segregate #sC from rising clusters are to be found in European Portuguese 
(Cavaco Miguel 1993) and, as we will see below, Cypriot Greek. 

A second source of evidence indicating that #sC is syllabically unlike 
rising clusters comes from articulographic studies of speech production. 
The results of one such study of Italian corroborate the morphophonemic 
evidence just reviewed. Using electromagnetic midsagittal articulography, 
Hermes et al. (2013) investigated the articulatory phasing of word-initial 
consonants relative to a following vowel. Their results confirm previous 
findings showing that rising clusters are coordinated as a unit — the C-centre 
effect. With #sC clusters, however, they found no C-centre effect: the 
presence or absence of an initial [s] (for example, spina versus pina) makes 
no difference to the timing of the C-centre relative to the vowel. Hermes 
et al. interpret this as reflecting different syllabications: rising clusters form 
complex onsets, while #sC clusters do not. 

A third source of evidence against syllabifying #sC like rising clusters is 
to be found in the process of consonant-cluster simplification that occurs in the 
early stages of first language acquisition. The process typically acts differently 
on the two sorts of sequence: in rising clusters it is the second consonant that 
is dropped (for example, [pej] ‘play’), whereas in #sC it is the second ([puwn] 
‘spoon’) (see McLeod et al. 2001 for a review of the extensive literature). 
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All of the evidence just reviewed suggests a negative answer to the 
question of how #sC is syllabified: not as a complex onset. A positive answer 
is suggested by the Italian morphophonemic evidence: #sC syllabifies as a 
coda plus an onset. A comparison of different Romance languages provides 
us with further insights into this syllabification through the presence versus 
absence of a vowel preceding historical #sC (cf. Spanish escuela versus Italian 
scuola). 

Other evidence suggests that coda-onset is not the only way #sC can be 
syllabified. The other alternative is where #sC results from the devoicing and 
eventual total syncope of a vowel between two consonants, as in Japanese 
[sukida] > [skida] ‘I like it’ (Tsujimura 2007) or Sesotho [srtulo] > [stulo] 
‘chair’. A reasonable analysis of this scenario is to say that the two consonants 
remain in different onsets separated by a silent nucleus. 

Since these two ways of syllabifying #sC are independent of one another, 
there is nothing to prevent them both from showing up in the same language. 
As Monik Charette has shown (1991, 1992), this is indeed the case in French, 
where words with etymological #sC (for example, sport, store) exist alongside 
words where #sC is created by syncope of an intervening schwa (for example, 
[s(a)riz] cerise, [s(a)mel] semelle). A similar situation prevails in some varieties 
of European Portuguese (Cavaco Miguel 1993; Kaye 1996). 

To summarise, the evidence reviewed above suggests word-initial 
consonant clusters come in three different syllabic configurations: (a) complex 
onset, (b) coda plus onset, and (c) onset plus silent nucleus plus onset. 
Configuration (a) is limited to rising clusters, while #sC can syllabify as 
either (b) or (c). We have expressed this three-way distinction in terms of a 
rather conventional notion of syllabic constituency. There are other ways of 
capturing the same three-way distinction, for example in terms of different 
dependency or licensing relations between segmental positions (for example, 
Lowenstamm 1996; 1999; Scheer 2004; Ségéral and Scheer 2008; Chapter 
23). However we conceptualise it, what is important here is that, according to 
the evidence reviewed above, there is a syllabification difference between #sC 
and rising clusters. It is this difference that we wish to investigate further by 
seeing whether it also shows up in speech perception. 


Is #SC special? 


One of the questions addressed by the dichotic listening study described below 
is whether there is something uniquely special about sibilant fricatives that 
allows them to form the only non-rising initial clusters in many languages. 
There are good grounds for viewing sibilance as bestowing an auditory 
perceptual privilege on a consonant that occurs word-initially before 
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another consonant, particularly when the latter is a stop. Compared to rising 
clusters consisting of an obstruent followed by an approximant, falling 
clusters consisting of any consonant followed by a stop are in general cross- 
linguistically disfavoured in initial position. The most plausible explanation 
for this starts from the observation that the closure phase of the stop inhibits 
transitional acoustic cues to the identity of a preceding consonant (Ohala 
1992; Steriade 1999; Wright 2004; Henke et al. 2012). If an initial consonant 
is to survive in this position, it is more likely to be identified by listeners if 
it provides them with segment-internal cues, such as we find in continuants 
but not in stops. Among continuants one of the most robust internal cues is 
frication noise. And among fricatives the most robustly cued are those where 
the intensity of this noise is greatest, viz. sibilants. 

The relative robustness of auditory-acoustic cues to consonants before 
stops suggests an implicational scale of phonotactic preferences governing 
clusters in initial position (Steriade 1999; Wright 2004; Henke et al. 2012). If 
a language that places phonotactic restrictions on initial clusters allows not just 
sibilants to precede a stop, we can expect the next best candidates to be nonsibilant 
fricatives. This is exactly the situation in Cypriot Greek. The language allows us 
to investigate whether initial clusters beginning with nonsibilant fricatives show 
the same anomalous syllabification behaviour as #sC. 


Word-initial clusters in Cypriot Greek 


Like all Indo-European languages, Cypriot Greek has a set of rising 
word-initial clusters consisting of a stop or fricative followed by a liquid, 
for example, [tria] ‘three’, [kréas] ‘meat’, [platis] ‘wide (m.)’, [kléftis] 
‘thief’. Like some Indo-European languages, it also has initial #sC, for 
example, [skafto] ‘I dig’, [spi0ca] ‘houses’. In addition to #sC, it has a 
limited set of #fC sequences — falling clusters where the first consonant 
is a nonsibilant fricative, for example, [ftoxds] ‘poor’, [fk6] ‘to go out’, 
[xtiz:o] ‘I built’, [0cd] ‘two’. These combinations are summarised in 
Table 10.1. 


Table 10.1 Word-initial consonant clusters in Cypriot Greek. 


Sonority Cl C2 Examples 

Rising Stop Liquid [pl, pr, tr, kl, ke] 
Rising Nonsibilant fricative Liquid [fl, fr, 61, Or, xl, xc] 
Falling Sibilant Stop [sp, st, sk] 

Falling Nonsibilant fricative Stop [ft, fc, Oc, xt] 
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The fact that the set of #fC clusters in Cypriot Greek is rather small is 
important in the context of the study to be presented below. Although 
Cypriot Greek has a rather wider range of initial clusters than, say, English 
or Italian, they are still subject to quite strict phonotactic constraints. This 
means that Cypriot Greek is not an ‘anything goes’ language (Dumercy 
et al. 2013), like Polish or Czech for example, where word-initial clusters 
are largely unfettered by sonority-based phonotactic restrictions (Gussmann 
2007; Orzechowska 2019). Dichotic listening experiments have shown that 
Czech speakers can fuse more or less any pair of initial consonants, regardless 
of sonority combinations (Dumercy et al. 2013). Why Cypriot Greek is of 
interest to us is that it provides a useful testbed for investigating whether 
sibilance is uniquely responsible for the syllabically anomalous behaviour of 
#sC. Dichotic listening studies indicate that in English, with its fairly tight 
phonotactic restrictions on initial clusters, #sC is less readily fusible than 
rising sequences (Chiu et al. 2016). With Cypriot Greek, we can test not only 
whether this difference is replicated but also whether it carries over to #fC. If 
it does, it points to [s] not being particularly special. 

An initial indication that #fC and #sC are syllabically alike in Cypriot 
Greek is provided by the morphophonemics of proclitics, which alternate 
according to the phonological shape of the stems they attach to. This is 
illustrated by the copula [en] in (1). The basic alternant occurs before a vowel- 
initial stem, as in (la). Before a singleton consonant or a rising cluster, we 
see partial or total assimilation of the copula-final consonant to the initial 
consonant of the stem, as in (1b) and (1c). Before a stem beginning with #sC, 
the copula-final consonant deletes, as in (1d); the same thing happens before 
#fC, as in (le). 


(1) a. [en arostos] ‘he is sick’ [en omón] “it is raw’ 
b. [em makris] ‘he is long’ [es sazméni] ‘it is fixed (fem.)’ 
c. [ev vreménos] ‘he is wet’ [en glaméni] ‘she is tearful’ 
d. [e spazménon] ‘it is broken’ [e skotoménos] ‘he is killed’ 
e. [e ftands] ‘he is silly/thin? [e xtarménon] ‘it is scratched’ 


These alternations are strongly reminiscent of those involving Italian proclitics. 
Since the latter are generally agreed to be driven by syllable structure, there is 
good reason to assume those in Cypriot Greek are too. 

Syllabic sensitivity explains the deletion of the copula-final [n] by 
linking it to the fact that the coda in Cypriot Greek contains at most one 
consonant. The [n] survives before a vowel because it can take up residence 
in an available onset, as in (2a). Before an onset, the occupies the preceding 
coda, where it undergoes assimilation, as in (2b) and (2c). 
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(2) a. [en-omdn] b. [en-makris]>[emmakris] c. [en—veeménos]>[evvreménos] 


o 6 o o o o 
XXXX. x XXX XXXXX 
enomon e makris e vremenos 


According to some analyses, all word-initial consonant clusters in Cypriot Greek 
syllabify as complex onsets, regardless of their sonority profile (for example, 
Coutsougera 2003; Armosti 2009). However, the morphophonemic evidence 
illustrated in (1) strongly suggests that only rising clusters syllabify this way; 
as shown in (2c), these behave just like singleton onsets exemplified in (2b). In 
contrast, if we assume that #sC syllabifies as a coda plus onset, as in Italian, we 
can explain why the n fails to survive in this environment: there is no syllabic 
position available to host it. As shown in (3a), the coda is already occupied by s. 
The same analysis can evidently be applied to #fC, as shown in (3b). 


(3) a. [en-spazménon] > [espazménon] b. [en-ftanós] > [eftands] 
fo} o o j 
X XXX... X XXX... 
easpazmenon eaftanos 


Although the analysis in (3) departs from the view that all initial clusters in 
Cypriot Greek are complex onsets, it nevertheless incorporates an insight 
provided by Armosti (2009) that some initial consonants can be moraic.? 
Under the analysis in (3), this is achieved through weight by position: being in 
a coda entails that the first consonant of a falling cluster bears weight. 

Based on their morphophonemic behaviour, we can draw two conclusions 
about the syllabification of #sC and #fC in Cypriot Greek: they syllabify the 
same way irrespective of sibilance, and they are not syllabified like complex 
onsets. These facts inform our choice of stimuli in the dichotic listening study 
to be presented below. 


Dichotic listening 


The use of dichotic listening as a method for studying the syllabification of 
consonant clusters is originally due to Cutting and Day (for example, Cutting 
1973; 1975; Cutting and Day 1972; 1975). The basic method involves simul- 
taneously presenting listeners with different words to each ear, with the words 
differing only in one consonant. For example, the listener hears a word with 
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an initial stop in one ear (for example, pay) and a word with an initial liquid 
in the other ear (for example, /ay). Listeners then report what they hear. The 
results of such experiments show two noteworthy effects. Firstly, listeners 
sometimes report fusion: two simultaneously presented words are heard as 
a single percept (here play). Secondly, at least with English listeners, fusion 
typically favours a rising sonority sequence (that is, play rather than *[Ipej]). 
Cutting and Day place a phonological interpretation on these effects: fusion 
indicates that listeners parse the two consonants as part of the same syllable 
onset, and sequencing reflects language-specific phonotactic constraints on 
consonant clusters. 

Chiu et al. (2016) found that, when English listeners are dichotically 
presented with different singleton consonants in varying sonority combina- 
tions, they readily fuse and sequence stop and liquid pairs as rising-sonority 
clusters, as in Cutting (1975), Cutting and Day (1975). In contrast, listeners do 
not readily fuse pairs where one consonant is [s], thus corroborating evidence 
that #sC is not a complex onset. 


Dichotic listening study of Cypriot Greek 


The aim of the dichotic listening study to be presented here is to discover 
whether speakers of Cypriot Greek perceive #sC clusters any differently than 
#fC and initial rising clusters. 


Method 


Forty-five native speakers of Cypriot Greek completed the study (27 female, 
age range 19-35, mean age 22;8 years).? All were born and raised in Cyprus. 
There were no self-reports of hearing problems or speech and language 
disorders. 

The selection of stimuli for the experiment was guided by the morpho- 
phonemic patterning described above. The pairs of consonants presented to 
listeners fall into the four major-class divisions shown in Table 10.2.4 Each 
consonant was placed in a nonword consisting of the initial target position 
followed by a _ VCV tail, for example, kovi, lovi, sovi. The tail was held 
constant except for one factor: the vowel following the target consonant was 
varied between [e] and [o]. Having a contrast between front and back vowels 
in this position is necessary to accommodate phonologically conditioned 
variation between palatal and velar realisations of dorsal consonants in 
Cypriot Greek. Members of each stimulus pair always contained the same 
vowel. 
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Table 10.2 Consonant pairs used in the dichotic listening study. 


Major Classes Pairs 

Stop|Liquid pll, tle, kļe 
NonsibilantFricative|Liquid fl, Ol, x|c 
Sibilant|Stop s|p, s|t, s|k 
NonsibilantFricative|Stop fik, O|k, x|t 


The intention behind using nonwords was to stress-test fusion, as 
previous dichotic listening studies have shown that listeners fuse consonant 
pairs more readily when the fused percept is an existing word (Day 1968, 
reported in Cutting 1973). The decision to use a disyllabic CVCV word 
frame was motivated by two factors. Firstly, disyllables are by far the 
most frequent word size in the Cypriot Greek lexicon (Themistocleous 
et al. 2011). Secondly, working with disyllables rather than monosyllables 
greatly increases the area of usable lexical space not already occupied by 
existing words. Having a singleton consonant in the non-target second 
syllable was considered preferable to having a cluster, since the latter could 
potentially interfere with the fusibility of target consonant pairs in the initial 
syllable. 

The stimuli were recorded in a sound-proof studio by a phoneti- 
cally trained female native speaker of Cypriot Greek. The experiment 
was conducted online using Qualtrics survey software." Participants, after 
confirming they were wearing headphones, were presented with the auditory 
stimuli. On each trial, participants were simultaneously presented with one 
nonword to each ear, with the members of each nonword pair differing only 
in the initial consonant, for example, povi, lovi. The nonword pairs were 
drawn from the set of options defined by the major-class permutations listed 
in Table 10.2. Participants were presented with four trials per nonword pair, 
counterbalanced for left versus right ear (for example, povi twice in one ear, 
twice in the other), to control for right-ear dominance in the perception of 
verbal stimuli (Kimura 1961). 

Immediately after being presented with each stimulus pair, participants 
indicated which of four alternatives they had heard. For example, after hearing 
povi in one ear and lovi in the other, they were given the choices povi, lovi, 
plovi, or Ipovi (see Table 10.3). The alternatives were presented visually on 
a computer screen in Roman spelling. These forced-choice response options 
were designed to capture (a) whether the listener heard a fused percept (plovi 
or lpovi) or not (povi or lovi) and, if so, (b) the order in which the target 
consonants were perceived (plovi versus /povi). 
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Table 10.3 Possible responses to dichotically presented nonword stimuli. 


Possible responses Examples 
Rising CC plovi 
Fused Cluster Falling CC jnovi 
Unfused Singleton Lower sonority C povi 
Higher sonority C lovi 
Results 


For reasons of space, we will limit our focus here to the question of whether 
the major-class affiliation of target consonant pairs influences participants’ 
responses to the input stimuli. We analysed responses for two possible biases. 
Firstly, is there a bias towards hearing stimulus pairs as fused (for example, 
plovi or lpovi) or not (for example, povi and lovi)? Secondly, where there is 
a preference for fusion, is one sonority sequence preferred over the other (for 
example, rising in plovi versus falling in /povi). 

Figure 10.1 groups the responses according to the major-class affiliation 
of the target consonant pairs and shows the extent to which each class 
combination is perceived as fused. The percentage values given in Figure 10.1 
were obtained by averaging across the three consonant pairs in each of the 
major-class combinations shown in Table 10.2, and across the four trials per 
stimulus pair per participant. 


1007 
80- 
60- 


% Fused 


40- 


Stop|Liquid Nonsib. fric.|Liquid Sib.|Stop Nonsib.fric.|Stop 


Figure 10.1 Cypriot Greek dichotic listening: participants’ preference for fused 
percepts (%Fused) classified by the major class of pairs of input consonants. 
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Does major class influence the likelihood of input consonant pairs to 
fuse? An analysis of variance of percentage fused values establishes that 
this is indeed the case: F(3, 176) = 16.904, p < 0.001, np? = 0.224. 
We followed up with paired comparisons to investigate which particular 
major-class combinations favour fusion more than others. Two results are of 
immediate relevance to the research questions we started out with. Firstly, 
there is no significant difference in fusion rates between Sibilant|Stop 
(M = 28.333%, SD = 35.060%) and NonsibilantFricative|Stop pairs 
(M = 29.444%, SD = 27.957%), p = 0.771, n.s. Secondly, neither of 
these fricative-stop combinations fuse anything like as much as the two 
obstruent-liquid combinations. Sibilant|Stop pairs fuse less readily than 
Stop|Liquid pairs (M = 52.963%, SD = 28.044%), #(44) = —4.993, p < 0.001. 
Sibilant|Stop pairs also fuse less readily than NonsibilantFricative|Liquid 
pairs (M = 65.370%, SD = 27.059%), t(44) = —7.245, p < 0.001. Similarly, 
NonSibilantFricative|Stop pairs fuse less readily than Stop|Liquid pairs, 
(44) = —5.4935, p < 0.001. NonSibilantFricative|Stop pairs also fuse less 
readily than NonsibilantFricative|Liquid pairs, t(44) = —8.531, p < 0.001.’ 

Now let us focus on fused responses, to see whether major class biases the 
way the input consonants are sequenced. To this end we examined the two classes 
that fuse most readily, namely Stop|Liquid pairs and NonsibilantFricative|Liquid 
pairs (Figure 10.2).8 In both cases, there is a clear preference for fusion into 
rising rather than falling clusters. This is confirmed by comparing percentages 
of rising versus falling responses. Fused Stop|Liquid pairs show signifi- 
cantly more rises (M = 92.898%, SD = 13.999%) than falls (M = 7.1023%, 
SD = 13.999%), #41) = 13.0999, p < 0.001. The same goes for fused 
NonsibilantFricative|Liquid pairs: more rises (M = 84.941%, SD = 17.286%) 
than falls (M = 15.059%, SD = 17.286%), (41) = 19.861, p < 0.001. 

Tosumup, participants fused Stop|Liquid and NonsibilantFricative|Liquid 
input pairs more readily than Sibilant|Stop and NonsibilantFricative|Stop 
pairs. Both Stop|Liquid and NonsibilantFricative|Liquid pairs were more 
readily fused as rising clusters than falling. A low rate of fusion was equally 
evident in Sibilant|Stop and NonsibilantFricative|Stop pairs. 


Discussion 


If listeners in this study were fusing input consonant pairs on the basis of what 
is legal as a word-initial (as opposed to syllable-initial) cluster in Cypriot Greek, 
we would expect no biases based on major class. This is because all four of 
the major-class combinations included in the study are attested in word-initial 
position (see Table 10.1). However, we do see major-class biases, and pretty clear 
ones at that, which suggests that something else is influencing the propensity 
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Figure 10.2 Cypriot Greek dichotic listening: participants’ preference for the 
sonority sequencing (Rise versus Fall) of fused input consonant pairs consisting 
of the two major-class combinations most susceptible to fusion, Stop|Liquid and 
NonsibiliantFricative|Liquid. 


to fuse. We believe the most plausible explanation is the one established in the 
earliest dichotic listening studies of Cutting and Day: listeners fuse on the basis 
of what is legal as a syllable-initial (as opposed to word-initial) cluster. 

The intervention of syllable phonotactics would explain the 
difference in fusibility between obstruent-liquid pairs (Stop|Liquid and 
NonsibilantFricative|Liquid) on the one hand and fricative-stop pairs 
(Sibilant|Stop and NonsibilantFricative|Stop) on the other. Obstruent-liquid 
pairs are more readily fusible into rising clusters because they make good 
complex onsets. Fricative-stop pairs, in contrast, are less readily fusible 
because they do not make good complex onsets. 

The relative resistance to fusion of fricative-stop pairs holds of both 
NonsibilantFricative|Stop pairs (for example, [x|t]) and Sibilant|Stop pairs (for 
example, [s|t]). This suggests that they have the same syllabification, which is 
consistent with the morphophonemic evidence reviewed Word-initial clusters 
in Cypriot Greek above. In other words, from the viewpoint of syllabification, 
#sC is not unique, at least in Cypriot Greek. 


Conclusion 


We started out here by reviewing various pieces of evidence that cast doubt on 
the once standard assumption that #sC is syllabified just like initial clusters of 
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rising sonority, that is, as a complex onset. The evidence comes from morpho- 
phonemic patterning, speech production, and early phonological development. 
To this we can now start to add evidence from speech perception. The dichotic 
listening study reported on here corroborates the conclusion that #sC is not 
syllabified as a complex onset. 

If #sC is not a complex onset, what is it? A limitation of our study is 
that it does not allow us to take the extra step of answering this question. At 
least for some languages, including Cypriot Greek itself, the morphopho- 
nemic evidence suggests #sC syllabifies as coda plus onset (however we wish 
to represent that). A rather different implementation of dichotic-listening 
methodology would be required to investigate whether this syllabification 
reveals itself in speech perception. 

What the present study does give us, however, is yet another reason 
for distancing ourselves from the traditional view that any consonant cluster 
finding itself at the beginning of a word necessarily also finds itself at the 
beginning of a syllable. 


Notes 


1. Special thanks to Spyros Armostis for his much-valued comments and advice as well as for his 
practical help with the design and running of the Cypriot Greek experiment reported on here. Thanks 
also to Constantina Gregoriades and Gisé Tomé Lourido. 

2. Armosti’s (2009) focus is on initial geminates (which were not included in our study). However, he 
does note that these pattern morphophonemically with initial falling clusters. 

3. The results from 16 other participants were discarded because they failed to complete the experiment. 
UCL Research Ethics Committee number LING-2013-11-12. 

4. The stimulus set also included sibilant plus sonorant pairs (for example, [s|n], [s|l]), which we will not 
report on here. Cypriot Greek limits initial sibilant plus sonorant clusters to [zm] (sibilant plus [n] or 
[I] occur in a few sporadic loan words). These pairs were included only for the sake of balancing the 
stimulus set against one being used for a parallel dichotic study of English (where [s] plus sonorant 
clusters are legal, for example, snap, slide). 

5. https:/Avww.qualtrics.com. 

6. The decision to use Roman rather than Greek writing was taken after consultation with our Cypriot 
Greek colleagues. While Greek writing is used for standard Greek, young Greek Cypriots typically use 
Roman spelling for writing Cypriot Greek, for example on social media. 

7. These pair-wise comparisons remain significant when corrected for multiple comparisons. 

8. Three participants, who reported no fusion of Stop|Liquid pairs, were removed from the sequencing 
analyses. 
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11 
The segholate verbs of English 
Jean Lowenstamm 


Introduction 


Based on joint consideration of their Present, Past, and Past Participle forms, 
English verbs are traditionally divided into two groups of unequal size.! 
A few form their past tense and past participle by means of Ablaut (for 
example, sing/sang/sung). Those are the ‘strong’ verbs. ‘Weak’ verbs, the 
overwhelming majority, merely suffix +ed to the stem (hoped, debunked, 
trespassed, etc.). However, a third group of some 60 verbs stands out on 
account of two negative characteristics: the verbs of that group show neither 
Ablaut nor +ed. On the other hand, their pasts and past participles always 
involve a final coronal obstruent whether one shows in the present or not 
(shot, hid, felt, rid, kept, etc.).? Because that ubiquitous final coronal is 
reminiscent of ted, and because of the absence of Ablaut, those verbs are 
viewed as a special pocket of weak verbs. They are called ‘irregular’ weak 
verbs (henceforth IWV). 

Much valuable work has been devoted to that group though usually 
in the context of more encompassing discussions of length alternations in 
English, cf. Kiparsky (1982), Myers (1987), Dresher and Lahiri (1991), 
Minkova and Stockwell (1996), Lahiri and Dresher (1999), Lahiri and Fikkert 
(1999) and references therein, to mention just a few. 

It is fair to say that IWV are viewed as a kind of morass, a historical 
residue, moreover one of modest size, and therefore of limited significance. 
I intend to argue, on the contrary, that they are part of a large and organised 
set enjoying a place of its own in the synchronic grammar of English.* Against 
the background of a comparison with the class of Semitic nouns known as 
segholates, I will argue that the irregular weak verbs of English are templatic.* 
In the sections that follow this introduction I will list characteristics of [WV 
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as well as some of the questions I will attempt to answer; provide a brief 
introduction to segholate nouns; and lay out fairly standard assumptions about 
the construction of English verbs along with a proposal of my own. After 
this, I implement my proposal in several steps, followed by a section in which 
I discuss a small class of exceptions and the intriguing voicelessness of their 
final coronal (built, bent, etc.). 


Some characteristics of IWV and related questions 


IWV are all ‘short’. Indeed, ‘longer’ verbs do not replicate their behaviour. 
Thus, Ait is the past of hit, but *solicit could never be the past of solicit; hid 
is the past of hide, but *subsid could never be the past of subside, etc. The 
corresponding questions appear in (1). 


(1) a. How is ‘short’ defined? 
b. Why are IWV ‘short’? 


One of the few solid generalisations about IWV is the exceptionless brevity 
of their past tense vowel in contrast with the free distribution of length in the 
present (keep/kept, read/read, cut/cut). The question in (2) therefore arises. 


(2) Why is vowel length not free in past tense forms? 
With respect to the final coronal present in all pasts, Myers (1987, 492) writes: 


(3) There are two past tense suffixes in English, both of which happen 
to be coronal stops; —t as in burnt, and —d as in faded. The former is 
restricted to the root level, while the latter can be either root level or 
word level. 


There is something highly intriguing with (3). Not only are the two supposedly 
distinct exponents of past tense almost identical, but their distributions 
overlap. Hence the question in (4). 


(4) Are there really two past tense suffixes? And they merely happen to be 
coronal? Or have we maybe missed something? 


A survey of a list of IWV reveals a massive gap: their roots exclusively end 


in p (keep), t (hit, bite), d (rid, hide), m (dream), n (lean), l (feel) and, in two 
exceptional cases, v (leave, bereave). Two questions arise: 
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(5) a. Does something in the grammar of English make it impossible for 
a verb whose root ends in, say, b, g, z, f, k, s, or š to be an IWV? 
b. Is the definition of IWV sound, to begin with? 


In the next section, I introduce relevant aspects of the segholate nouns of 
Arabic. 


Segholates 


The typical surface profile of segholate nouns is CVCC. Examples are given 
in (6). 


(6) a. b. ë: 
bahr ‘sea? diyn ‘religion’? hubb ‘love’ 
bint ‘gir’ Cuwd ‘lute’ sadd ‘dam’ 


Their ingredients — root and vocalic melody — are mapped onto a template as 
shown in (7). The root in (7a) inasmuch as it contains no glide is known as 
‘sane’. The root in (7b) with its medial glide is dubbed ‘hollow’. Finally, the 
biliteral root in (7c) is called ‘deaf’. The latter spreads rightward according to the 
scheme laid out in McCarthy (1979). A low-level rule affecting all segholates 
from hollow roots will map diyn and {uwd into [din] and [Sid], respectively. 


i u 


a 
| 
CVCVCV CVCVCV CVCVCV 
ae Ai \ MV 
HR VD YN VHB 
[bahr] [din] [hubb] 


Given the types of roots in (7) and the makeup of the template, the above 
configurations exhaust the set of logical possibilities. 

The point of a comparison with Arabic segholates is this: Germanic 
roots, as Jacob Grimm had discovered (Grimm 1878), reproduce the same 
array: CVCC; (like bahr), CVC (like din), and CVC, (like hubb). Does the 
presence of that array in Germanic also reflect the presence of a template? 
Following Bendjaballah (2012, 2014) on German, I will argue that the IWV 
system, too, is template-based. 
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Templatic versus non-templatic English verbs 


Following many, I make two assumptions. First, I assume that verbs arise as 
the result of the selection of a root by categorising head v, [p V VP]. Second, 
I assume that verbs move up to Tense and left-adjoin to the head of that 
constituent. Depending on the content of Tense, verbs will be Present or Past. If 
Present, the verb comes out unsuffixed, if Past, +d (henceforth Dp,s) is added. 

My proposal is that English v comes in two versions.° One is the 
‘plain’ v; it merely makes it possible for a root to play a role in the syntactic 
computation. The other version of v has the same effect, but it specifies 
in addition a condition on the material realisation of the verb: its phonetic 
ingredients will have to be realised within the confines of the space defined by 
a verbal template, [, C V C V C V], once more the same as for the segholate 
nouns of Arabic. 

In (8), I show the representation of four verb forms, hope, hoped, keep, 
and kept at a point where they have left-adjoined to the head of Tense and 
Tense has released the exponent of its value, ø if Present, Dp, if Past. VHOPE 


has been selected by ‘plain’ v, and D,,., therefore attaches as a suffix (8b). 


Past 
VIP, by contrast, has been selected by ‘templatic’ v. In (8c), the root deploys 
over the template as indicated. In this case, the surface result unremarkably 
parallels that of (8a), viz. [CVC]. But in the Past (8d), Dpast 
inside the template, much as if an augmented root — VKIP+Dpact — was now 
involved. In this case, the increased segmental load of the root thwarts the full 
deployment of the underlying long vowel and shortening follows,’ [kept] vs. 
[kip] as opposed to [ho’p]/[ho’pt] with no shortening.’ 


must be realised 


(8) a b. 
TP TP 
vP T present vP Tpast 
J/\ 4+ AA 4 
v VHOP ø v VHOP Dhs 
Ç, d. 
JAN TP 
Ha vP T 
h [iy Dpast 
CVČ \ CV] [CVCV l v] 
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We now have answers to (la): the template defines ‘short’; (1b) ‘long’ roots 
(solicit, subsidise, etc.) will never pattern as IWV do because their size 
exceeds the capacity of the template. Their sole access to verbhood will be 
through selection by ‘plain’ v. Less trivial is another consequence, namely 
the fact that one and the same ‘short’ root can be selected by either version 
of v, thus making it possible for an irregular weak verb to also have a regular 
doublet, viz. dreamt/dreamed, knelt/kneeled, etc. Note moreover that there is 
nothing necessary in the fact that keep is templatic and hope non-templatic. 
Indeed, VKip might as well have been selected by ‘plain’ v, thus giving rise 
to [kiYpt], or by both ‘plain’ v and ‘templatic’ v giving rise to a [kept]/[ki’pt] 
doublet. Similarly, hypothetical [hopt] or the hypothetical doublet [hopt]/ 
[ho’pt] would have been as well-formed as the attested [hopt].° 

This account can be compared to an earlier one capitalising on the 
distinction made available by Lexical Phonology between Level 1 and Level 
2 suffixation. It runs as follows: the last consonant of keep is extrametrical, 
kee(p); the syllable of the stem therefore counts as open. When ¢ attaches 
at Level 1, £ now counts as extrametrical (keep(t)), and Closed Syllable 
Shortening must apply, yielding kept. On the other hand, hoped has a case 
of Level 2 attachment, and so no shortening takes place. The problem is that 
Level 1 attachment would have to be restricted to ‘short’ verbs inasmuch as 
‘longer’ verbs, for example, elope, never show any shortening in the past 
(*[elopt]). In Lexical Phonology, as far as I know, this coincidence between 
Level | attachment and shortness has to be fortuitous. By contrast, it follows 
from my proposal that vowel shortening will take place only where the verb 
is templatic. 

Thirteen past tense verbs, listed in (9), are derived in the manner 
described in (8d). 


(9) (bereft, crept,'° dealt, felt, heard, kept, knelt, left, lost, meant, slept, 
swept, wept; and three antiquated or British forms: dreamt, leant, leapt. 


More verbs 


The read/read group 


A second group of verbs is exemplified by read, [riyd/red]. That second group 
shares something with the first group, namely the vowel is long in the present 
tense, [ri¥d], but short in the past. But, whereas the shortening in kept followed 
from the extra load of consonantal material incurred by the presence of D,,,,,, 
the past tense of read shows no such increase. Why does the vowel shorten 


nevertheless? This is my next point. 
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In (10a), I have given the representation of Present read. It is comparable 
to (8c). In (10b), the augmented root deploys over the template positions, thereby 


forcing the brevity of the vowel. The internal realisation of D ast Will never allow 


ast 
an underlying long vowel to survive in the past. This answers the question in (2). 
In (10c), the OCP conflates radical D and D, 


pronounced as a simplex segment. The claim here, in other words, is that English 


asr The geminate thus created is 
has virtual geminates in the sense of the argument developed in Lowenstamm 
(1996) about germane verbal evidence involving the Danish stöð.!! 


(10) a. b. c. 
CVCVCV CVCVcCV CVCVCV 
W y YY 
RID RID+D RID 


l 
[redd] — red? !? 


Past 


The fourteen past tense verbs derived in the manner just described are listed 
in (11). 


(11) sped, slid, chid (archaic), plead, shot, read, met, lit, lead, hid, fed, bred, 
bit, bled. 


Note that the roots of all verbs in the group just reviewed end in ¢ or d. None 
of the roots of the verbs reviewed in the preceding group, (9), did. This 
complementary distribution, obscured by the fact that English geminates 
remain unpronounced and are inconsistently acknowledged by the spelling 
conventions in word-final position, clearly shows how that second group is 
just a special case of the first. For the sake of clarity, I will benchmark such 
inclusion relations in the format of (12). 


(12) Root type CVC 
Coronal-final roots read, meet 
Other roots keep, deal 


A prediction is now within reach. It is the topic of the next section. 


The cut/cut, rid/rid group 


If a configuration such as (10c) is recognised, it cannot be confined to Past 
Tense verbs. Rather, because the template is the same in Present and Past, it 
should be observable in Present Tense verbs as well. If that is correct, there 
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should be a group of verbs of type CVd(d) or CVt(t) in which the Present and 
Past are phonetically identical. That group indeed exists. It can be exemplified 
by means of verb rid. 


(13) a. b. c. 
CVCVCV evcevey  CVCVCV 


W \I/ / WV 


RID RID+D pyst RID 


In (13a), I show how the present of rid is formed. Spreading is required 
by the imperative of template saturation. There is a positive reason for 
supposing that Present rid involves a geminate. It rests on an extension of 
Hammond’s point: if a geminate is present in ridding and ridden, there must be 
one in their stem, too. The past of rid in (13b) involving the augmented root, no 
spreading of the root-final consonant is necessary in this case. The OCP kicks 
in, (13c), and the same form, [rıdd], eventually surfaces for both the Present 
and the Past. The nineteen verbs patterning in that fashion appear in (14). 


(14) bet, bid, cut, fit, hit, knit, let, put, quit, rid, set, shed, shit, shut, slit, spit, 
spread, wed, wet. 


Now is the time to try and answer the question in (4). If we place the verbs of 
(14) in the benchmarking frame, we can easily fill in the root type box: by the 
account just proposed, the roots of those verbs are of type CVC,C, (like Arabic 
hubb). But what sits in the ‘Other roots’ box? That is, of what larger group are 
the verbs in (14) the coronal-final subtype? 


(15) Root type CVCC, 
Coronal-final roots cut, rid 
Other roots ? 


At this point, spelling must be briefly discussed because it is, in reality, 
the sole basis on which a verb is deemed regular or irregular. Thus, if the 
Past Tense forms of dab, hug, buzz could be written dabd, hugd, buzd, 
they would be viewed as bona fide IWV. As it is, English spelling does 
not sanction final clusters such as bd#, gd#, zd# and those pasts can only 
be written dabbed, hugged, buzzed, which makes them look regular. ft# 
clusters are possible in English but the Past of verbs such as puff or sniff is 
never written puft or snift, rather puffed and sniffed. ckt# is not sanctioned 
by English spelling and therefore [hakt] can only be written hacked. Just 
like graphic zz-, and ff-final verbs, ss-final verbs never simplify under any 
circumstances. Thus, [hist] for instance will have to be written hissed. 
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Except in loanwords, sht# is not a possible final graphic sequence and [lt] 
will have to spell /ashed. Finally, even though pt# is graphically possible, 
[kleept] which used to be spelled clapt must now be spelled clapped, as do 
most other p-final verbs. 

Evidently, those very numerous monosyllabic roots with a short vowel, 
which I represent — as in (16a,c) — as would be done for Semitic words from 
biliteral roots, are tokens of the general pattern of which rid, cut, etc., are 
the special, coronal-final subpattern. Because the OCP and the silence of 
geminates conspire with English spelling to obscure things, I have brought out 
the formal parallelism between dabbed (16b) and rid (16d) with the hourglass 
(Z) between the coronal obstruents in (16d) indicating that the operation of the 
OCP is imminent but hasn’t taken place, yet. 


(16) a. b. CG; d. 
CVCVCV CVCVCV CVCVCV CVCVCV 
WY OOS 7y 
DAB DAB+D pact RID RÍDŠD pas 
dab, /dæbb/ dabbed, /dæbd/ rid, /ridd/ rid, /ridd/ 


Squatters and why they devoice 


As pointed out, for a verb to be involved in the processes I have described, 
its makeup must be such that the augmented version of its root does not 
exceed the capacity of the template. Clearly, a CVCC, root, a root consisting 
of three different consonants, for example, curb, will fit into the template on 
its own, but not what I have called its ‘augmented’ or ‘past’ version. In such 


cases, D,,., will have to be realised extratemplatically: curbed, etc. 


Past 
Can any such verb stand a chance to force in its augmented root, never- 
theless? Here, a two-faceted prediction is made. If a CVC; root is coronal- 


final, its last segment will allow Dp,,, to dock and coalesce. Such is the case 


Past 
of build, cost, as shown in (17). The second facet of the prediction is of 
course that this can only happen if the verb is templatic. If the root underlying 
the verb cannot fit the template as, for example, manifest, the past will have to 


be regular, that is, manifested/*manifest. 


(17) a. b. 
CVCVCYV CVCVCV 


LT 11/7 T 


COST+ D past BILD+ D 
cost built 


Past 
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Such verbs are: 
(18) bent, built, burnt, cast, cost, hurt, learnt, lent, sent, spent, thrust. 


In this case, the content of the benchmarking frame records the fact that the 
only possible IWV from CVC,C, roots must be coronal-final. 


(19) Root type CVCC; 
Coronal-final roots build, cost 
Other roots none 


The truly interesting feature of those verbs is the voicelessness of D p,,, which 
systematically accompanies coalescence. This is where I attempt to answer the 
question in (3) and refute the notion that English has two distinct Past Tense 
suffixes, both of which accidentally happen to be coronal. 


On the view depicted in (17a), the voicelessness of Dp, in cost, and 


Past 
cast, hurt, thrust is unsurprising. But why are the pasts of build, bend, lend, 
send, spend t-final??? Why doesn’t coalescence maintain, indeed foster, 
voicing, yielding [bild], [lend], etc., for both present and past? To put things 
differently, if d is legitimate in [p,..en bil _ #], why not in [pas bil _ #]? 

My proposal involves two conjectures. First, I submit that D p, differs 
in an important way from other d’s of English — crucially from the root-final 
d of \BILD ‘build’ — in being unspecified for voice. The d of the root, in other 


words, is [+ voice] whereas the d of Dp, is [? voice]. In the manner of a 


Past 
probe, it seeks valuation of its [voice] feature in the environment. Second, 


I submit that valuation of the [voice] feature of D,,,, can only be sought 


Past 
(though not always found as we will soon see) in the very specific place 


defined by the rule in (20). 


(20) CVCVCV 


aN 


[voice] — [a voice] / [a voice] (X) [Dpyy_ | 
The rule in (20) formulates the idea that D,,,, will have the same value for 
voice as the consonant linked to the central templatic position italicised and 
underscored there. The parenthesised expression represents the possible (but 
irrelevant) presence of a consonant attached to the same templatic position as 


D,,,, as in, for example, (21c,e). 


Past 
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(21) a. b. c. 
CVCVCV CVCVCV CVCVCV 


Ve \ ZT 


KIP+D COST+D 


Past Past 


d. e. 
CVCVCV CVCVCV 


Wy I //7 


BILD BILD+Dp,,, 
All the configurations reviewed up to this point appear in (21). The reader 


can verify that the value for voice of D in (21a,b,c) is the same as in the 


Past 
consonant linked to the italicised templatic position in (20).!° In (21d), I 
have represented the present of build for comparison with its past built (21e). 
The final root-coronal in (21d) is specified for voice and nothing more needs 
to be said. The fact that in (21e), D pyst 
feature in the adjacent radical d is expected, because the latter is not attached 
to the templatic position specified in (20). Rather, a sonorant is attached to 
past? Because of 
space limitations the outline of an answer only will be offered. Universally, 


does not find valuation of its voice 


that position. Why does a sonorant cause the devoicing of D 


sonorants appear to have no impact on the voice value of neighbouring 
segments. One interpretation of that fact is that sonorants are themselves 
Past 1 (21e) will not 
be able to value its own voice feature according to the rule in (20), and it will 


phonologically unspecified for voice. '* If this is correct, D 


simply default to [- voice] eventually devoicing the adjacent root consonant. 
Similar examples of post-sonorant default devoicing of D p, in WV are given 


in (22). 


Past 


(22) dealt, dreamt, felt, knelt, leant, meant, hurt, learnt, bent, lent, sent, 
spent, dwelt, spelt, smelt, spilt.'> 


Conclusion 


In this brief chapter, based on an analogy with the mode of deployment of 
root consonants in the class of Arabic segholate nouns, I have argued that 
the 60 or so IWV of English are much less irregular than might seem and 
that there is nothing elusive or mysterious in the form of their Past Tense 
morpheme. 
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Notes 


10. 


ii; 


12. 


13; 


14. 


15. 


I am grateful to Edoardo Cavirani, Radwa Fathi, Noam Faust, Andrew Nevins, Tobias Scheer, Yuko 
Yoshida, and Connor Youngberg for help and encouragement. All mistakes are my own. 

While all verbs will eventually be reviewed, space limitations leave me no choice but to assume 
familiarity with the evidence. Where lists are provided, I have not distinguished between dialects of 
English. 

For a radically different view, see Kaye (1995). 

In doing so, I follow a line of research wherein morphological devices usually associated with 
Semitic or Afroasiatic languages can be shown to play a major role in the morphology and phonology 
of languages outside that family. Early efforts in that vein include the demonstration in Ségéral 
(1995) and Ségéral and Scheer (1998) that the vocalisation of every single Standard German strong 
verb is amenable to the Ablaut system argued for in Guerssel and Lowenstamm (1990; 1996) for 
Classical Arabic; or, closer to the point of this chapter, the argumentation in Bendjaballah (2012; 
2014) whereby non-concatenative marking is shown to be similarly implemented in the verbal 
systems of genetically and typologically unrelated languages such as Taqbaylit Berber and Standard 
German. 

Following Lowenstamm (1996) or Scheer (2004), the template is represented as a sequence of light 
syllables. 

Cf. Faust (2019) for a related proposal. 

See Lowenstamm (1996) and Scheer (2004) for general discussion and Balogné Bérces (1998) for 
English specifically. 

For similar cases of templatic pressure causing vowel shortening in Chaha, see Lowenstamm (2000; 
2001). 

With two paths to verbhood, the ongoing historical change whereby IWV’s tend to shift to regular 
weak, can be construed without having recourse to analogy. 

The reader may wonder how crept or dreamt with their two initial consonants fit into a template with 
one single, initial C position. Perhaps, those clusters should be analysed as complex segments as 
argued in Lowenstamm (2003), [kept], [d'emt]. Alternatively, the template should perhaps include 
an extension, viz. [v C(C) V C V C V]. I am not really interested in pressing either point because all 
the phenomena to be discussed here take place elsewhere, that is, in the underscored portion of the 
template: [v C(C) VC VC V]. 

On Danish, see Larsen (1994). Hammond (1997) provides arguments specific to English in support of 
that view. He observes that because they sit in open syllables the lax stressed vowels in [heDi] Hattie 
or [mini] Minnie should normally undergo tensing. The fact that they do not, he notes, follows from 
his claim that the intervocalic consonant is in reality a geminate. 

The question mark records my scepticism about degemination. If degemination had any consequence, 
for instance if it fed or bled a process; or if it allowed for the statement of a non-trivial generalisation; 
or if it applied in context-specific fashion, I would accept any of that as solid evidence. As no such 
evidence has been produced (to the best of my knowledge), I would rather think of the geminates of 
English as intact albeit unpronounced. I will continue noting the pasts of read and rid as [redd] and 
[ridd]. 

The voice value of Dp, in (21c), it is claimed, is due to s, not to radical t. On the same view, the 
voiceless member of the f/v alternation in leave/left, bereave/bereft does not result from the presence 
of a final t in the Past Tense. The opposite is claimed: the voicelessness of Dp, results from the 
voicelessness of the f of left or bereft. 

See Scheer (2015a,b) for an in-depth discussion of the voice status of sonorants, including a critical 
review of the literature as well as the author’s own views. 

By the account I give of default devoicing of Dp,« in the presence of a sonorant, heard — contrary to 
fact — should be pronounced as hurt. 


Past 
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From me to [ju:]: on government 
licensing and light diphthongs 


Markus A. Pochtrager 


Introduction 


This short contribution discusses the representation of English [ju:] and, in 
a wider context, distributional asymmetries in (light) diphthongs. It picks up 
the line of research started in some early works in Government Phonology 
(GP) (Kaye et al. (KLV) 1985; 1990; Charette 1990; 1991), especially Kaye 
and Lowenstamm (1984) and Kaye (1985). It shows where the theoretical 
machinery commonly employed for light diphthongs is in need of further 
development. Potential for such improvement comes from two sources: On the 
one hand from Government Licensing (Charette 1990; 1991), and on the other 
from GP 2.0 (Péchtrager 2006b; 2018; 2020; 2021a,b; Kaye and Péchtrager 
2013; Zivanovié and Péchtrager 2010). Firstly, because GP 2.0 argues that 
structure continues below the traditional ‘segmental’ level — put differently, 
that we have not reached the right level of granularity yet — and light diphthongs 
show that we need to zoom in closer. Secondly, because it takes seriously the 
various asymmetries between elements, and that between I and U has a role 
to play in (light) diphthongs, too. Frustratingly, while it seems clear in which 
direction we must look, the precise answers are far from obvious. 

Let us begin with English. When looking at the distribution of English 
[j] one has to make a distinction between [j] preceded by a consonant 
(henceforth: Cj) and [j] not preceded by a consonant (henceforth: plain [j]). 
Two pieces of evidence point towards plain [j] sitting in an onset: It goes with 
a as the indefinite article (*an yard) and, since onset and nucleus show great 
independence of each other, plain [j] precedes practically any vowel: yes, 
yield, yip, yawn, yeast, yoke, yearn, yard, university, etc. 

This is quite different from Cj as in queue [kju:], and the problem has 
long been known (Chomsky and Halle 1968, 192 ff.; Polgardi 2015; Szigetvari 
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2016). Harris (1994, 62) takes Cj to be a branching onset in parallel to [bl] in 
blue. Thus, by the Binarity Theorem (Kaye et al. 1990) he correctly excludes 
*[blj], *[klj], *[trj], etc., since a constituent cannot dominate more than two 
positions. Cj can be dominated by a binary branching onset, while *[blj], 
etc., cannot. However, there is a downside: If Cj is indeed a branching onset, 
then why is the following vowel restricted to [u:] (or variants thereof, that is, 
reduction to schwa in unstressed position, lowering by r)? That is, why is there 
[kju:t] but no *[kjeit], *[kja:t], etc.? This dependency is unexpected if onset 
and nuclei are independent of one another, and suggests that [j] and [u:] share 
a constituent, forming a complex nucleus of the kind that Kaye (1985) dubbed 
light diphthong.! Light diphthongs are attached to a single skeletal slot, hence 
(metrically) ‘light’, with the general structure given in (la), but one of their 
component parts might also extend over two positions, as in the [ju:] in cute 
with a long second member (1b). This is the representation that also Polgardi 
(2015) assumes, though in the slightly different framework of Loose CV. 


(1) a. Structure ofa light diphthong b. English [ju:] in cute, etc. 


R O R 
| | 
N N 
| 
x xX XxX Xa... 
/\ | VV 
a p k j u 


Treating [ju:] in the Cj context? in this way correctly predicts the lack of 
phonotactic restrictions between [ju:] and the preceding onset, where almost 
all consonants are possible: [kju:t], [mju:t], [lju:d], [vju:], [hju:], etc. (Some 
English varieties bar coronals, though, which I cannot go into here, but see 
Note 12.) But by treating [ju:] as fully contained in the nucleus we lose another 
phonotactic generalisation: An onset preceding [ju:] cannot branch. Onset 
and nucleus are (relatively) independent of one other, yet *[krju:t], *[blju:] 
do not exist.> Note that other kinds of complex nuclei, for example, heavy 
diphthongs, are not so restricted (brown, clay). In other words, the [j] in Cj 
behaves as if it belonged to onset and nucleus simultaneously, as assumed by 
Giegerich (1992, 158), who links the [j] in a word like view to both constitu- 
ents. This captures the ambiguous constituency,‘ but fails to explain it: Why is 
[j] special and why in this particular way? In order to answer these questions 
we will have a quick look at light diphthongs in French in the next section, 
followed by an attempt to express the ambiguous constituency and then add on 
another complication in the fourth section of the chapter when looking at the 
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other glide of English, [w], as well as French (again) and Japanese. The final 
section of the chapter gives a summary and discusses open issues. 


A first look at light diphthongs 


Within GP, light diphthongs have been studied in detail for French (Kaye and 
Lowenstamm 1984), Vata (Kaye 1985), and Japanese (Yoshida 1996), among 
others. 

French has three glides that occur in light diphthongs:> front unrounded 
[j] as in miette [mjet] ‘crumb’, front rounded [q] as in muette [myet] ‘mute 
(fem.)’, and [w] as in mouette [mwet] ‘sea gull’. Kaye and Lowenstamm 
(1984) discuss various tests for their constituency, based on phonotactic 
restrictions to the left/right and the behaviour of determiners. For example, 
the definite articles le/la lose their vowel before vowel-initial words and thus 
provide clues about the nature of what follows. (2) illustrates that test for 
nouns beginning with [w] (examples from Kaye and Lowenstamm 1984, 136) 
and reveals a systematic ambiguity. 


(2) a. le watt [lo wa...] ‘the watt’ b. l’oie [Iwa...] ‘the goose’ 
le week-end [la wi...] ‘the weekend’ Toint — [lwé...] ‘the anointed one’ 
le western [lo we...] ‘the western’ 
le wombat [lə wõ...] ‘the wombat’ louest [lwe...] ‘the west’ 


The words in (2a) behave as if they began with a consonant, those in (2b) as 
if they began with a vowel. The contrast is particularly stark when the vowel 
following [w] is identical between the two sets, as in le watt/l oie. Accordingly, 
Kaye and Lowenstamm (1984, 136) argue that one has to distinguish between 
[w] in the onset and [w] as part of the nucleus, that is, a light diphthong: 


(3) a. Onset + Nucleus b. Light diphthong 


O R O R 
| | 
N N 
| | 
x x x 
| | A 
w a W a 


This not only explains why (3a) behaves as consonant-initial (because the 
onset has both a point and melody attached to it) while (3b) does not (because 
its onset is empty).° Furthermore, Kaye and Lowenstamm argue that (3b) 
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correctly predicts there to be restrictions on the nature of the vowel following 
[w], since [w] and the following vowel share a position. No such restrictions 
hold in (3a), where [w] and the following vowel occupy separate positions and 
constituents. Lastly, further support comes from words like trois [tswa] ‘3’, 
where the onset branches to its (binary) maximum, and still there is room for 
[w]. This must mean that [w] sits in the nucleus, that is, structure (3b), but with 
a branching onset instead. 


Government Licensing 


Applying this kind of reasoning to English we can posit the following repre- 
sentation for gueue, as has been done for example, by Polgardi (2015). 


(4) English queue with a light diphthong 


O R 
| 
N 
he 
x = X X 
|, oie 
k j w 


This explains why the preceding onset is quite unrestricted in quality (again, 
leaving the ban on coronals in some varieties aside) and also why there are 
restrictions on the following vowel, since the glide shares a position with (the 
first half of) it. However, (4) fails to explain why branching onsets cannot 
precede the nucleus. If we still want to stick to (4) we will need to explore in 
more detail under which conditions branching onsets and/or light diphthongs 
are possible at all. 

The first question, under what conditions branching onsets are licit, 
is (partially) addressed by Charette’s (1990; 1991) notion of Government 
Licensing (GL).’ GL expresses the idea that the head of an onset needs a 
licence to govern a non-head position, and this licence ultimately derives 
from the following nuclear head. (5a) illustrates this for a case where an 
onset governs a complement, as in tree, and (5b) for an onset governing a 
preceding rhymal position (winter). The two overlap in a word like pantry, 
where the ¢ governs both the preceding rhymal complement as well as the 
onset complement. 
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(5) a. GLin tree b. GL in winter 


O R O R O R 
l | 4 
PSs | WAIN 

x X X X x X X X X 

LLL Ep] 

t I E w tion t 9 


c. GL in pantry 
(0) R O 


R 
\ 
| 
x 
| 
p 1 


N 
x x x x x 
few 


The labels direct and indirect refer to the ‘flow’ of dependency, which moves 
consistently away from the nucleus in direct GL, but changes direction 
in indirect GL (and skips a position, at least at the skeletal level; though 
see below). The difference can be illustrated by what kind of GL a final 
empty nucleus can dispense. French allows both direct and indirect GL by 
a final empty nucleus, hence pacte [...kt] ‘pact’ (direct GL) as well as table 
[...bl] ‘table’ (indirect GL). Final empty nuclei in English, however, only 
allow direct GL: pact is fine, but in table we do not find a final branching onset 
bl, but rather a sequence of onsets with an intervening empty nucleus.’ At least 
in final position it seems that the possibility of indirect GL implies direct GL; 
a final empty nucleus appears less likely to confer indirect GL.° 

The same asymmetry comes back before [ju:]. We do not find branching 
onsets preceding, as mentioned before (*[krju:t], *[blju:]), but we do find 
coda-onset clusters: endure [indju:a], impute [mmpju:t], spew [spju:], skew 
[skju:], etc. (Incidentally, the last two examples support yet again Kaye’s 
(1992a) proposal that sC cannot be a branching onset but is fine as coda- 
onset.) The recurrence of this asymmetry suggests strongly, that here, too, GL 
is at play. In other words, just like a final empty nucleus in English, [ju:] can 
dispense direct, but not indirect GL (6a—b). 
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(6) a. Direct GL in impute b. (No) indirect GL *[trju:] 
OR O R *O 


| 
eee 
Pil AV 
P 


1 m jou 


R 
l 
» 
x X X X 
TA 


Iı j u 


c. Indirect GL in tray 
O R 


In contrast, heavy diphthongs as in tray, cloud, ploy (6c) do allow for direct 
GL. The question is, why is there such a difference in GL potential between 
the two types of diphthong? Given that the two structures differ in several 
aspects at the same time, isolating the responsible factor is tricky. (Several 
factors might also be interdependent.) Let us look the differences individually. 

The first obvious difference between a heavy and a light diphthong is 
that the head of the nucleus, the ultimate source of licensing, is associated to 
two separate phonological expressions in the case of a light diphthong, but a 
single one in a heavy diphthong, as contrasted in (7). 


(7) a. N of queue b. N of toy 
with elements with elements 
R R 
N N 
x x x x 
I U A I 
U 


By containing two phonological expressions (that is, two units with separate 
elements), the head is more complex in the light diphthong in (7a). Obviously, 
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complexity here cannot mean counting elements in the style of Harris (1990), 
since the head of the nucleus (the leftmost skeletal point) is associated to the 
same number, two, in both (7a) and (7b). We rather want to capitalise on 
the complex one-to-many association between skeleton and phonological 
expression in (7a). But while this usage of ‘complex’ might seem intuitive, 
from a theoretical point of view it remains unclear how exactly it is relevant 
for GL: Charette (1990; 1991) assumes that all nuclei project to the so-called 
licensor projection; see also Yoshida (Chapter 13). Governing onsets (and only 
those) are also present at that particular projection, and it is at this level that 
GL takes place. But on the licensor projection the elements and the configura- 
tions in which they attach to a skeletal slot, that is, the contrast between (7a) 
and (7b), should be irrelevant. Above the skeleton both are identical. 

The same objection can be raised against the second point of difference 
between (7a) and (7b), viz. the position of the weaker member of the 
diphthong. In a heavy diphthong this is the offglide, accurately captured 
in (7b), as the offglide is linked to the non-head position of the branching 
nucleus. In a light diphthong like [ju:], on the other hand, one might deem the 
onglide [j] less important, the ‘ultimate head’ is what follows. Thus, in order 
to get from that ‘ultimate head’ to the preceding onset one would have to cross 
the glide and this non-local relationship is why direct GL is precluded. But 
note that none of this can be read off the structure in (7a). Given Charette’s 
definition of where GL takes place and the fact that headship is defined at the 
skeletal level, the onglide and (at least the first half of) the following vowel 
are equidistant from the preceding onset, as they both associate to the leftmost 
skeletal position. !° 

Both points so far show that the theory is not fine-grained enough, or 
somewhat incomplete: It has the means to express a crucial distinction, but 
inadequately, since it does not allow us to derive the precise properties of the 
two types. The same holds true of the third difference between the structures 
in (7): The light diphthong in (7a) contains no A, while heavy diphthongs like 
[av], [ar], and [or] (7b) have A in the head position, and no A in non-head 
position — at least in English.!! Within GP 2.0 (Péchtrager 2006b; Kaye and 
Poéchtrager 2013), the element A has been under extensive scrutiny, since it 
differs in its behaviour from other elements. A seems to license more complex 
structures than otherwise possible, for example, Southern British English 
draft, with a long vowel (containing only A) followed by a coda-onset cluster 
(with coronal ź also assumed to contain A), cf. Péchtrager (2012; 2021b) for 
discussion of such super-heavy structures. The presence of A might also be 
(part of) the reason why heavy diphthongs (7b) can government license a 
preceding branching onset, while the light diphthong in (7a) cannot. (Not 
the sole reason, since monophthongal high vowels do allow preceding 
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branching onsets, for example, tree or true, so A is only one ingredient.) Note 
again that nothing in the original definition of GL allows us to derive this in 
an insightful way, and there is no deeper reason why A of all elements should 
interact with GL. The best we can do at this point is state it in prose. !? 

While it is not entirely clear how exactly those factors can be made 
relevant, it seems clear that at least some of them impact the potential of 
a nucleus to dispense (direct) GL. In fact, by assuming that a lack of GL 
is responsible for the ungrammaticality of *[klju:], etc., another parallel 
emerges: English prohibits branching onsets preceding [ju:] in the same way 
that it disallows them before final empty nuclei, that is, we also do not find 
final branching onsets.'? And as we saw before, coda-onset clusters are fine 
before either (imp[ju:]te, imp®). French however allows branching onsets 
both before (certain) light diphthongs (trois ‘3’) and before final empty nuclei 
(maitre ‘master’); so once again the GL abilities of light diphthongs and 
(final) empty nuclei match up.'* We will come back to this at the end of the 
concluding section of this chapter. 


| and its unlike brother U 


Once we broaden our view beyond [ju:], the picture becomes considerably 
more complex, as the quality of the onglide also plays a role. This asymmetry 
can already be shown in English, but will get even more important as we 
move on to other languages. But let us look at English first and contrast [j] 
to [w]. Despite both being glides, their distribution is quite different. Plain w 
(not preceded by a consonant) can be followed by almost any vowel (wheat, 
wit, wet, wait, walk, woe, etc., though *[wa:]) including [u:/e] (woo, woman). 
In Cw (consonant preceding) we notice distributional restrictions on the 
preceding consonant: coronal obstruents (twin, dwell, thwart) and velars are 
fine (quick, Gwen), but labials are not (*[pw], *[fw], etc.), and neither are 
sonorants. All this suggests that in words like twin, etc., we are dealing with 
branching onsets.!5 If Cw is fully contained in a branching onset, we do not 
expect systematic restrictions on the following vowel, which is indeed correct 
(twin, tweet, twice, twang, twat), only with the exception of *Cw+[u:/s/a/ 
av], which might or might not be an accidental gap.'° Furthermore, the ban 
on labials can then be construed as a general homorganicity restriction similar 
to why we do not find [tl/dl] as a branching onset, as is often assumed, for 
example, Harris (1994, 172). And finally, the lack of *[twr] or *[trw] falls 
out from the Binarity Theorem, imposing an upper limit of two members on 
constituents. Unlike [j], [w] seems to be a fine second member of a branching 
onset. !7 
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Now, this difference between [j] (containing the element I) and [w] 
(U) is unsurprising, given that asymmetries between those two elements have 
long been noted (Charette and Göksel 1994; 1996; Denwood 1997; Goh 1997; 
Péchtrager 2015; Mutlu 2017; Živanović and Péchtrager 2010). For example, 
in Turkish vowel harmony (Charette and Göksel 1994; 1996; Péchtrager 
2010), U spreads only into a subset of the positions that I spreads into. 
Péchtrager (2015) argues that similar I/U-asymmetries are found in English 
heavy diphthongs and the internal structure of Mandarin nuclei, among others. 

Such asymmetries in the behaviour of I/U can be found in various 
languages and all kinds of phonological structures, including glides in light 
diphthongs. Kaye and Lowenstamm (1984), cf. the second section of this 
chapter, discuss the varying constituency of French [y] and [w] (onset versus 
part of nucleus), but do not go into [j]. This is not accidental: While we find 
cases where [j] qualifies as the first member of a light diphthong after a single 
consonant (8a) and at least one initial case (8b), there do not appear to be cases 
with a preceding branching onset (8c). 


(8) a. bien [bjé] ‘well’, rien [wjé] ‘nothing’, mien [mjé] ‘mine’, vieux 
[vjo] ‘old’, pièce [pjes] ‘piece’, piaf [pjaf] ‘sparrow’, piaule [pjol] 
‘den’, etc. 
b. les yeux [lez jo] ‘the eyes’ 
*(bhé], *[ksjo], *[trja], etc. 


What sets apart [j] (I) from [y] (both I and U) and [w] (U) is the lack of U in 
[j]. For a light diphthong to license a branching onset, then, it seems that U 
is needed in the first member. In fact, this same requirement can be exploited 
to explain (part of) the English facts: [j] does not contain U, so *[plj-], etc., 
will be out. And since [w] never occurs as the first part of a light diphthong in 
English, the issue of having to or not being able to license a branching onset 
will simply not arise. Of course this does not yet explain why [w] cannot act 
as the first member of a light diphthong, but that is an independent problem. 
But in any case, we do not know yet why U should be needed to make GL 
possible. 

While we can identify a common restriction for the two languages, it is 
unclear to what extent generalisation beyond English and French is possible. 
Spanish makes clear that this condition might not be universal, at least if forms 
like griego ‘Greek’ are to be analysed as a branching onset [gr] followed by a 
light diphthong [je], which seems likely (Harris 1983). 18 

Let us finally turn to Japanese for another example of an I/U- 
asymmetry in glides. Japanese does not restrict C in a Cj-sequence and 
lacks branching onsets in general. Accordingly, Yoshida (1996, 151ff.) and 
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Kaye (1992b, 145ff.) take [j] as part of a light diphthong. (9) gives examples 
from Kaye (1992b, 148).!9 


(9) kyaku ‘visitor’, hyaku ‘100’, ryaku ‘omission’, zyama ‘hindrance’, 
myaku ‘pulse’, etc. 


Further support for a light diphthong analysis comes from the fact that front 
vowels are excluded after [j] (*[ji], *[je]). If [j] shares a position with another 
expression we do expect phonotactic restrictions. There is one important point 
of contrast to English, though. English plain [j] (see the introductory section of 
this chapter) behaves like an onset and shows no restrictions on the following 
vowel, while for Cj the following vowel is restricted. In Japanese, the restric- 
tions on the following vowel hold irrespectively of whether [j] is ‘plain’ 
or post-consonantal, suggesting that both cases involve a light diphthong 
(preceded by an empty onset in the case of Japanese plain [j]). 

As is already to be expected by now, the distribution changes when 
we look at the glide [w], which only precedes [a] in native Japanese words 
and never follows a consonant in the standard variety (*[wi], *[twa]). This 
suggests that [wa] is a light diphthong and unable to license even a non- 
branching onset. (Unless that non-branching onset is empty, of course.) In 
contrast to French, [w]-initial light diphthongs seem to be the more restricted 
type in Japanese. 


Summing up 


Table 12.1 summarises what we have seen for the three languages that we 
have looked at in some detail. 

When presented like this, three patterns can be gleaned from Table 12.1. 
Firstly, French (part (a) of the table) has the most liberal system, in the sense 
that light diphthongs show great freedom in what onsets they are preceded by: 


Table 12.1 Systematic comparison of permitted glide plus vowel sequences and 


structural conditions for (a) French, (b) Japanese, and (c) English. ‘—’ indicates 
systematic gaps that exist independently of light diphthongs. 

(a) French (b) Japanese (c) English 
Onset jV yV wv jV 4 wv jV yV wv 
Empty yes yes yes yes = yes a = 
Non-branching yes yes yes yes — E yes — 


Branching omo S yes = = — [=] = 
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empty, non-branching, or branching. The only restriction occurs with [j]-initial 
light diphthongs. Moving on to part (b), Japanese seems to allow a subset of 
the French system — on the condition that we ignore systematic gaps (such as 
the systematic lack of branching onsets or of the glide [y] in Japanese). That 
is, everything that is possible in Japanese is also possible in French. English 
(part (c)) can then be taken as a subset of Japanese, since all that is possible 
in English (which is very little, in fact) is also possible in Japanese or French, 
unless excluded for independent reasons. 

This neat pattern might well run into trouble once extended to even 
more languages. Spanish, for example, is even more liberal than French in that 
it allows branching onsets before light diphthongs ([je/we]): griego ‘Greek’, 
prueba ‘test’, etc. Yet at the same time it is not immediately clear that Spanish 
allows empty onsets preceding light diphthongs, since (at least) plain [j] is 
quite free in Spanish in what it is followed by and thus must be taken as an 
onset (rather than the first half of a light diphthong), that is, exactly what we 
saw for English in first section of the chapter. This would make it Jess liberal 
than French. 

Secondly, in all languages in Table 12.1 a light diphthong beginning 
with [j] bars branching onsets, or, put differently, an U is needed for GL 
to happen. (Again, this is not true for Spanish, which is more liberal.) The 
reasons for this special role of U in GL are mysterious. For Japanese such a 
restriction can be stated vacuously only, as there are no branching onsets to 
begin with. Note in this context that the beneficial properties of U in a light 
diphthong, whatever they are, do not hold for simple onsets: [j]-initial light 
diphthongs in Japanese allow preceding consonants (that is, non-branching 
onsets with melody), [w]-initial ones do not. 

Thirdly, English [ju:] as a light diphthong disallows both branching 
onsets and empty onsets. (Allowing empty onsets would predict that there 
are [ju:]-initial words behaving as vowel-initial; recall plain [j] from this 
chapter’s first section. This is an interesting conjunction of onset types, whose 
commonality lies in being a deviation from the simple, non-branching case. 
Van der Hulst (personal correspondence) surmises that a similar conjunction 
holds for empty nuclei and branching nuclei, both deviating (but in opposite 
directions) from simplex, non-empty nuclei. In fact, we saw a variant of this 
in the section ‘Government Licensing’, when we looked at how English and 
French final empty nuclei (that is, an empty nuclear head) patterned together 
with light diphthongs (nuclear head complex) in their GL abilities: Either both 
disallowed direct GL (English), or both allowed it (French). I say ‘variant’ for 
two reasons: (i) Light diphthongs are not necessarily branching nuclei (they 
can be, as in English), though they are of course complex. (ii) Final empty 
nuclei differ from internal empty nuclei, so they cannot be lumped together 
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as one class. Finally, let us notice that Spanish obscures the pattern consider- 
ably, in that its light diphthongs do not pattern with final nuclei with respect to 
GL at all. Spanish has both branching onsets (griego ‘Greek’) and coda-onset 
clusters (entiendo ‘I understand’) preceding light diphthongs, but neither can 
precede final empy nuclei. (Consonants preceding final nuclei are generally 
quite limited in Spanish; only (some) coronals and [x] occur.) 

While this little survey of light diphthongs raises more questions than it 
answers, it illustrates how in a seemingly innocent topic such as the analysis 
of light diphthongs a whole series of different questions comes together: the 
nature of I/U asymmetries, the internal structure of nuclei and what counts 
as a deviation from the ‘normal state’ of a constituent, and last not least the 
interaction with preceding onsets. At least for the last aspect we have the 
established tool of GL.7° 


Notes 


1. I take [jæ] in piano to be immediately preceded by an unstressed syllable (and not by a consonant); 
similarly in Vietnam, Myanmar. I assume that those sequences do not form a light diphthong. This 
assumption is not absolutely crucial, since there is no guarantee that English has only one diphthong 
beginning with [j]. (Southern British) English has more than one (heavy) diphthong beginning with 
[a] (mouse, mice) and more than one ending in [v] (crow, crowd). More light diphthongs might be 
developing, but we expect some restrictions between the two members to remain. However, the 
number of possibilities will pertain to the discussion of substantive conditions in the ‘Government 
Licensing’ section of this chapter. 

2. ‘[ju:] in the Cj context’ sounds clumsy, but I remind the reader that talking about [ju:] ‘in general’ 
would (incorrectly) lump together [u:] preceded by plain [j] and Cj. The ambiguous interpretation 
of [ju:] illustrates a structural ambiguity in phonology, examples of which can also be found 
elsewhere. (French p/ can be a branching onset whose members remain adjacent, as in tripler ‘to 
triple’/je triple ‘I triple’, or a sequence of two onsets which can get broken up, as in appeler ‘to 
call’/j’appelle ‘I call’.) I will stick to just ‘[ju:]’ in the following, urging the reader to keep this 
footnote in mind. 

3. This does not follow from Polgardi’s (2015) account, and I cannot see a simple way to implement it 
in her analysis. Polgardi does address the lack of branching onsets preceding [ju(:)/ja] in unstressed 
position, however, though by a different mechanism. 

4. In Giegerich’s proposal the single skeletal position that [j] is attached to is shared between onset 
and peak (his nucleus). This would be inexpressible in GP, where skeletal positions are never shared 
between constituents. Attaching [j] to two skeletal positions, one associated to each constituent, 
circumvents that problem but incorrectly predicts long [j:]. 

5. Scheer (Chapter 19) analyses word-internal [Cj] sequences in French cimetiére ‘cemetary’, 
cafetière ‘coffee maker’, hôtelier ‘hotel keeper’, etc., as two consecutive onsets, which behave 
like the (Strict CV equivalent of) branching onsets (except for when C is a lateral), instead of 
onset plus light diphthong. It is not clear that his arguments extend to the examples in the main 
text or those in (8) in the section ‘I and its unlike brother U’, all of which have word-initial [Cj]. 
There is no guarantee that constituency is constant across positions. Scheer bases his rejection of 
[j] as the first part of a light diphthong on the contrast between ca/f[a]tiére ‘coffee maker’ versus 
hôt[ə]lier ‘hotel keeper’: the former can lose schwa, the latter cannot, which is unexpected if 
[V] constitutes a light diphthong and thus a source for PG in both cases. This assumes that PG is 
the only factor responsible for keeping a vowel silent, but as Heo (1994) has argued for Korean, 
adjacent consonants might play a role, too. (Which Scheer’s own analysis capitalises on as well, but 
by different means.) 
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10. 


Ti; 


12, 


13; 
14. 
15. 


16. 


17. 


There is a third logical possibility that also allows for h aspiré as in le héros [lə evo] ‘the hero’, treated 
as a consonant despite the lack of an audible reflex (Charette 1991). In such a case there is an onset 
with a point (but without any melody), which is why it patterns with (3a). 

‘Partially’ because the basic substantive and formal conditions on branching onsets are given by 
other parts of the theory, such as the Binarity Theorem (Kaye et al. (KLV) 1990) and the Complexity 
Condition (Harris 1990), and/or earlier charm (Kaye et al. 1985; 1990). 

I take ‘syllabic consonants’, at least in English, to be just another name for onset plus onset with an 
intervening, non-p-licensed (that is, realised) empty nucleus, as I am not aware of any need to make 
a distinction. Blaho’s (2004) work on long and short syllabic consonants does not seem to contradict 
that view. 

While we will look at another instance of the same asymmetry in GL potential imminently, I will leave 
open whether the preference for direct over indirect GL can be generalised completely independently 
of position and for an entire language, as in Kaye and Lowenstamm (1981) or Cyran (2010). Thai 
(Smyth 2002) might be problematic for such a view since it has what seem to be bona fide branching 
onsets (indirect GL, with gaps similar to English/French, that is, no *¢/-), while lacking coda-onset 
clusters (direct GL), except for loans. 

Kaye (1985) assumes that the two melodic expressions associated to a single position are unordered. 
(This follows from the assumption that the skeleton is the only place where linear order is specified.) 
His proposal implies that the order in acoustic realisation must be determined by general principles 
of the input-output system. To my knowledge this has never been followed up or clarified. In fact, for 
English, an ordering is specified, though only indirectly so: The U of [ju:] attaches to two positions in 
the skeleton, with the second ordered after the first, hence I must precede in realisation. This does not 
change the fact that for GL the head of the nucleus (a skeletal position) is relevant. Since that nucleus 
is associated to the melody of the onglide and (part of) the following vowel, the problem in the main 
text remains unsolved. 

French and Japanese (see the fourth section of this chapter) do have light diphthongs containing A, 
and in French they also allow direct GL. There, other factors come into play for the GL potential, as 
we shall see. I will have to leave open whether there are ever heavy diphthongs without A, and, if 
yes, what their properties are; cf. Péchtrager (2006a) for a discussion of Finnish, Péchtrager (2015) 
for English and Mandarin. 

The element A, which interacts with structural properties, has been replaced by structure in GP 2.0. 
That version of the theory holds that in general the common understanding of phonological structure 
has not reached the right level of granularity yet. While this does not yet explain the connection 
to GL, it does at least allow us to understand better the internal complexity of various kinds of 
diphthongs (Péchtrager 2015). The particular structure chosen as a replacement of (old) A also has 
a bearing on [ja] as in accurate, one of the unstressed counterparts of [ju:] and similarly lacking in 
direct GL potential. In GP 2.0 [9] has the same structure (a head and a non-head) as the high vowels 
[i/1/u/v], thus explaining the parallels between [ja] and [ju:]. Those high vowels still differ from 
schwa, in that in the former one position is empty, while in the latter both positions are empty; this 
expresses aperture. For details, refer to Pochtrager (202 1a). 

The replacement of old A by structure also has consequences for coronals, since they, too, were 
assumed to contain A (at least in some varieties of the theory): Coronals will then contain more 
structure (despite their purported unmarkedness, on which cf. Péchtrager 2021b). This might explain 
why in some varieties of English coronals are barred from preceding [ju:] just as branching onsets 
are: Both are too complex. Evidence for the complexity of coronal stops comes from the various 
kinds of lenition they are subject to (Péchtrager 2016). 

For ‘syllabic consonants’ as in apple see Note 7. 

This does not yet explain why they match up. 

Two comments are in order: Firstly, if [kw/gw] were treated as labiovelars (rather than as branching 
onsets), a separate stipulation would be needed to exclude them as part of branching onsets, in coda 
position and finally. That [gw] only occurs in names is surprising but possibly accidental. Secondly, 
loans like reservoir can show [w] after a labial, but it seems more likely that that word is borrowed 
with a light diphthong [wa], rather than containing a branching onset [vw]; cf. Fudge (1987) for more 
examples. 

[swu:] does exist (swoon, swoop), supporting yet again Kaye’s (1992a) view that sC is never a 
branching onset; see also Chapter 10. 

I do not mean to imply that all cases of Cw are always branching onsets; velar+[w] might of course 
be a (mono-segmental) labiovelar in some languages. Note that treating [kw/gw] as a branching onset 
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in English will be problematic for Péchtrager’s (2019) requirement that branching onsets contain at 
least one coronal member (including r). 

18. In the section ‘Government Licensing’ and, in particular, Note 10, we looked at A as a potentially 
crucial factor in the GL abilities of light diphthongs. The two light diphthongs of Spanish, [je/we], 
both contain A in their second members; but again, so does French [je], and the two languages differ 
in the GL abilities of [je]. 

19. Both authors treat forms like [tfa] ‘tea’ as [t]+[ja], as commonly done for Japanese (Vance 1987; 
Labrune 2012; Péchtrager 202 1b). 

20. ... and for that we say thank [ju:], Monik. 
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Licensor tier and culminativity 
Yuko Yoshida 


Introduction 


A lexical word cannot have more than one primary stress or pitch accent (after 
Hyman 2006).! This chapter aims to provide a unified account for such culmi- 
nativity of a word domain, in relation to the licensors and licensed nuclear 
positions.? Licensed nuclear positions include an empty nucleus observed in 
vowel-zero alternation. Both stress/pitch accent and vowel-zero alternation 
are the manifestation of licensing operating at the licensor projection (Charette 
1990) proposed in relation to vowel-zero alternations (VZA), for example, in 
French. 

The headship of a domain culminates at the projection where all the 
licensors are present. That accounts for why the weak member of a branching 
constituent and empty nucleus receives no word accent. In Japanese, a nuclear 
position is obligatorily filled by an element, unlike in a language where VZA 
occurs at an empty nuclear position: there is silence when such a position is 
licensed, while an unlicensed empty nuclear position (in French) is interpreted 
variously as [œ] in non-final positions and [e] in final accented position. 
In Japanese, as in the case of Tangale (Charette 1990) a nucleus filled with 
element(s) loses phonetic interpretation at a licensed position. High Vowel 
Devoicing (HVD) is an instance of such non-interpretation of the licensed 
positions and does not in fact involve a deletion (structural change). 

Headedness of a melodic expression contributes to the bifurcating 
phonetic interpretation of the same element(s) (Yoshida 2019). A headed 
element is projected to the Licensor Tier (LT): for example, an element |I| on 
LT manifests phonetically as [i], while non-headed |I| is not projected to LT 
and would be devoiced. The |I| element on LT is phonetically interpreted as 
[i] and, when licensed, the same element may not be fully interpreted as [i]. 
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Accent location is determined at the tier of licensors, namely the LT, 
where the heads of various domains at all levels are projected. The VZA 
equivalent in Japanese, HVD, rarely occurs when the nuclear position bears 
the word accent, and unaccented high vowels, /i/ and /u/ are typically subject 
to devoicing (Vance 1987; Yoshida 2013). Stress facts lend support for the 
interaction of heads at LT, in particular, in languages such as English that 
show quantity sensitivity in stress assignment (Hayes 1995). In morpho- 
logically simplex English nouns, for example, heavy syllables attract word 
stress, thus the head member of a branching nucleus and a head nucleus of a 
branching rhyme preceding its complement are stressed: the head member is 
on LT and so visible to stress/accent assignment. 


Problems 


An accented position and its melodic content are intricately related in many 
languages and Japanese is no exception. In this chapter, I would like to focus 
on issues of pitch-accent in Japanese and stress in English, in particular related 
to quantity sensitivity. 

For a noun without lexical marking, the default position for pitch accent 
in Standard Japanese (and some other dialects) is the antepenultimate nucleus 
with full melodic content (Yoshida 1999). 


(1) a [mu rása ki] ‘purple’ 
b. [ho to tó gi sw] ‘mountain cuckoo’ 


In Standard Japanese, two high-vowels, namely /i/ and /u/, are subject to 
devoicing, which strongly relates to accent: accented high-vowels are unlikely 
to be devoiced, and in the assignment of accent, devoiced high-vowels are 
ignored: they are transparent (2a,b). A high vowel is subject to devoicing 
when flanked by voiceless consonants. Note that [6] represents a voiceless 
bilabial fricative, to be distinguished from /ø/, an empty nucleus, which is 
central to the issues in this chapter. 


(2) a. [naná ġw eigi] ‘seven wonders’ 
b. [boo é ki coo] ‘trading company’ 


In traditional analyses geminate consonants are counted as one ‘mora’ 
(Haraguchi 1991); however, that ‘mora’ behaves in a unique way in the 
assignment of pitch accent. The default accent location is antepenultimate 
in Standard Japanese, as noted above, and a geminate consonant is counted 
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when it appears within the last three ‘moras’ (3a), but that ‘mora’ itself is not 
accented (3b) (Yoshida 1999). 


(3) a. [iburri gak ko] ‘smoked pickles’ 
b. [ma tw bó k kur ri] ‘pine cone’ 


In the framework of Government Phonology (GP), we can simply say that the 
empty nuclear position that separates the two O slots in a geminate consonant 
takes part in the counting of nuclei, but is not accented (as an empty nucleus 
cannot take an accent, cf. Yoshida 1999). 

The second member of a vowel sequence, namely a long vowel or a 
diphthong, is also avoided in accent assignment. Within a branching nucleus 
(4a), the first position is the licensor of the two, thus the head of the constituent 
domain only is the candidate for the word-domain head. 


(4) a. [þwráwaa] ‘flower’ 
b. [rée daa] ‘radar’ 


What the three types of examples above reflect is that a nuclear position 
without full phonetic interpretation cannot be accented in the word domain. 
Such positions are licensed, and licensed nuclear positions cannot be projected 
to the Licensor Projection (LP) (Charette 1990). 


Theoretical development 


Licensing of a given domain should operate in a regular manner for a speaker 
to manage processing in a given utterance. Considering the generalisations 
above in the section ‘Problems’, the phonological processes observed are the 
instances of licensing at a projection where licensors of domains are projected 
(Yoshida 2019). Here, we will focus on how these accent facts are related to 
the LP proposed for vowel-zero alternation. 


Licensor Projection (LP) 


Charette (1990) proposed the LP as a means of shedding light on vowel zero 
alternation in French. This projection is of course responsible for other phono- 
logical phenomena, and the attempt in this chapter is to develop the theoretical 
implications of determining the head position of a phonological domain, 
to formalise previous work and to propose the LT. One of the manifesta- 
tions of licensing between two adjacent nuclei is Proper Government (PG), 
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which regulates vowel-zero alternations. PG results in non-interpretation of 
the licensed nuclear position, which Charette proposed as a development of 
PG discussed by Kaye (1987). 


(5) Proper Government (PG) 
a. The governor may not itself be governed. 
b. The domain of PG may not include a governing domain. 


To discuss PG (5), we first confirm what a governing domain refers to. 
Government is an instance of licensing (Charette 1990, 1991), and I develop 
the concept of a licensing tier within Strict CV (Lowenstamm 1996) in this 
chapter. Government of a licensing domain can be expanded to that of a phrase 
(Yoshida 1999) and can be found also within a constituent (Kaye et al. (KLV) 
1985, 1990) as shown in (6). This is shown with an arrow. 


iN 


(6) a © b. N c. 
= ->X e 


x x x 
The LP (Charette 1990) is a tier where all the licensors are projected. These 
are the head nucleus of an inter-nuclear licensing relation (6a), the head of a 
branching onset, which itself is an intra-constituent governing domain (6b), 
and the non-nuclear head that governs its preceding rhymal complement (6c). 
This is shown in (7). 
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(7) a. b. 


N N N Nuclear Projection 
| | 
N N N N O N Licensor Projection 
| | | 
ONONON O N O N 
Le ll A 
x X X KX XK X x X XXX 
Ll dd I lt 
s ® men ®@ s a kre 
semaine ‘week’ sacré ‘sacred’ 
c. 
N N N ON N Licensor Projection 
| | 
ONONONON OR OR OR OR 
| yl] 
N N N N 
ltd dl 
X xX xX X X X X &X x xX XX X X X X X 
Prt Tid LP UPE Er 
suv ®ni r® pasrvoaonisr:r @ 
souvenir ‘remember’ parvenir ‘arrive at’ 


This chapter extends the context of licensing from relations operating in a 
phonological domain of a CV pair and between 2 Vs of the consecutive CV 
pair to those at work in a word domain, which designates the head position of 
the word domain. Phonological domains often extend to those of phrases, that 
is, merged domains of different syntactic categories (Yoshida 1999); however, 
we limit our discussion to those of nouns in isolation for brevity. 


Accent assignment and licensed positions: towards a non-diacritic 
approach to the headedness of an element 


The focus here will first be on the accent location of the Japanese words 
examined in (2)-(4). The default accent location is antepenultimate, and some 
sets exhibit pre-antepenultimate accent (Yoshida 1999). The viewpoint in this 
chapter is that the pre-antepenultimate V position is the head of the entire 
word domain. V-slots that qualify as the head of the word domain have to be a 
licensor at the lower level and will be projected to the LT. 

Apparent diphthongs or monophthongs involve sequences of V 
positions. In Standard Japanese, like in many other varieties, the two V 
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positions contract licensing relations, namely the initial position of that 
sequence licenses the second one. In other words, we do not need to postulate 
an extra tier of skeletons (x-slots) and what effectively works is the LT. For 
the vowel sequences question, the initial V position is the head that will be 
projected to that tier. The second position within the apparent branching 
nucleus is a licensee of the initial position and thus not projected to the LT. 
The licensor of the word domain projects to what we now call the LT, and thus 
the second position never attains the status of the licensor of the word domain 
as it is licensed itself. Except for certain varieties of Japanese such as Kansai 
Japanese (Yoshida 2019), we hardly ever obtain a lexical item that places an 
accent on the second member of what can be perceived as a diphthong or a 
long monophthong, and they belong to separate CV pairs, not a branching 


constituent. 

(8) a. /ai/ b. /a:/ 
Licensor Tier (LT) V V 

CV Tier (CVT) C l G Vv GV CV 
Melodic Tier (MT) l | a 


This analysis of apparent monophthongs and diphthongs finds a support in 
the pitch pattern of Kansai dialect: the second member of the consecutive 
nuclear material can be accented. In this case the second V is not licensed 
and is projected to the LT, which can be the head of the domain, the accented 
position. 

In Japanese, devoiced high vowels, an intervening nuclear position 
within a geminate and the second member of the long vowels are licensed. 
Licensed positions are not projected to the LT. The first structure we 
investigate is a devoiced high vowel, which is more typical in Standard 
Japanese than in many western dialects, for example, the Osaka dialect. The 
difference manifests in the actual pronunciation as well: /u/ in the Osaka 
dialect is pronounced as lip-rounded [u], and in the Standard as [w]. The 
devoicing of the back vowel is far less frequent in Osaka Japanese than in 
Standard Japanese. That is due to the floating melodic content |U| in Standard 
Japanese (Yoshida 1999, 2019), and that does not allow the nucleus to project 
to the LT due to general principles of association and projection. Thus, the 
accent does not land on that nucleus. 
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(9) a. Osaka dialect [u] b. Standard Japanese [w] 


Licensor Tier (LT) V 
CV Tier (CVT) C OV C V 
Melodic Tier (MT) IU] IU] 


This line of analysis provides an explanation as to why some western dialects 
of Japanese locate accents on a so-called moraic nasal /N/, on which we 
do not normally find accents in Standard Japanese. The GP view is that 
the moraic nasal /N/ is a nasal element for the onset and an empty nucleus 
(Yoshida 1990) unlike other standard approaches, and in fact, /N/ involves 
the element |U| (Yoshida 1999, 2019).? In the Osaka dialect, an accent can 
land on the so-called moraic nasal /N/, which rarely happens in other eastern 
dialects including Standard Japanese, which is a hybrid variety based on the 
Tokyo dialect. As was proposed in Yoshida (2019), a headed expression of 
this element |U| projects itself to the LT. Though the representations of /N/ 
involve a nasal element associated to the onset positions (Yoshida 1991, 
1999), similar to the distinction of [u] (Osaka) and [ur] (Standard Japanese), 
the representations of /N/ (10) below differentiate Osaka /N/ with the headed 
|U] and Standard Japanese /N/ with a floating (non-headed) |U]. Only headed 
|U] can be projected to LT to serve as the prosodic head of the word domain 
(see Yoshida 2019 for more detailed discussion). 


(10) a. Osaka dialect /N/ b. Standard Japanese /N/ 
Licensor Tier (LT) IU] 

CV Tier (CVT) cC vV cy 
Melodic Tier (MT) n |U n |U] 


The headed |U| is projected to LT; however, non-headed |U| is not, which 
results in a /N/ which cannot bear an accent. 

Now we turn to geminates. A geminate ‘consonant’ counts in the count 
of Vs, but it cannot be the site of a pitch accent. This is another instance of a 
nuclear position not being projected to the licensor projection or licensor tier. 
This can be easily explained from a Strict CV point of view if we assume that 
a geminate consonant involves a consonant linked to two onset or Consonant 
positions sandwiching an empty V. For example, loan words, which also 
have accents on the antepenultimate V, will count the empty V between the 
geminate consonant. The two examples below are from Italian loans, ricotta 
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‘ricotta cheese’ and (Ponte) Vecchio ‘old (bridge)’. The former has a geminate 
consonant /tt/ and that part counts as one unit for accent assignment. Yoshida 
(1999) defined the empty V to be counted at the nuclear projection for accent 
assignment; however, we might want to focus on the fact that the onsets 
surrounding that empty nucleus form a licensing domain, and the licensor 
onset is projected to the LT. That’s exactly where the licensor of the word 
domain is selected, the antepenultimate licensor on that tier. When the ante- 
penultimate licensor coincides with the geminate licensor, the accent shifts 
towards the left as in /bekkio/ (11b). We refer also to the LP (Charette 1990) 
organised into the representation with LT. The LP includes the licensor of 
the geminate consonant: here I assume that a head-final geminate embracing 
an empty nucleus is governed by the following filled nucleus. C slots do not 
serve as an accent bearer, and this suggests that the determination of an accent 
bearing slot works on a different tier. The LT only accommodates the licensor 
nuclei, or those that license in various forms such as PG that hold between 
the nucleus enclosed between the geminate consonants and the nucleus that 
follows it. 


(11) Geminates 


a. /rikotta/ b. /békkio/ 
LT V V V V yVy— Vy 
Ar A a 

| 

CVT CVCVCVCV CVCVCVCV 
ct Se Ri alee 
MT c || k jA| t |A| b JAI k {I |A| 
|U] II| |U] 


We have demonstrated how empty nuclei interact in accent assignment in 
Japanese along with the necessity of the LT. 

The reason we count nuclei to assign the word accent, but we need 
to refer to a different tier, LT, is to actually choose the nucleus that can be 
the licensor of the domain, as the head of the entire prosodic domain. So 
the nuclear positions which are not projected to LT are not candidates for the 
domain head. 

The phenomenon of course is not a language particular matter and what 
is happening in other languages, for example in English, lends support to the 
presence of a LT. 
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Theoretical implication for English word stress 


English stress on ‘branching constituents’ 


LT along with the idea of properly representing empty nuclei can explain why 
branching constituents attract stress in English. A large class of English nouns 
are proparoxytonic when the penultimate syllables contain a non-branching 
rhyme. When the penultimate syllable contains a branching constituent, either 
branching nucleus or rhyme, English nouns are paroxytonic. (There are also 
nouns with lexical stress like Kentucky or medulla.) Most examples are from 
Halle and Vergnaud (1987). 


(12) Proparoxytonic — Antepenultimate Stress 
América cínema óctopus 


(13) Penultimate Stress when it has: 
i) a branching nucleus ii) a branching rhyme 
aróma agénda 
balaláika amalgam 


English word stress exhibits sensitivity to constituent structures as the above 
examples show, and stress locations prove to be highly predictable, by 
counting the ‘syllables’. The last syllable of English nouns is extrametrical 
(Hayes 1981; Halle and Vergnaud 1987), and thus the word-final rhyme is 
invisible to stress assignment. But that invisible rhyme is included in the 
count of nuclei to determine the third nucleus from the last. Here we’d like to 
focus on what we are actually counting. This comes down to identifying what 
a ‘syllable’ is, and we interpret this in terms of onsets and nuclei contracting 
licensing relations. In standard GP, as well as Strict CV theory, a nucleus is 
allowed to be empty, and both phonetically realised nuclei and empty nuclei 
are subject to counting. In the count of nuclei to determine stress/accent 
assignment, filled nuclei are equal to empty nuclei (Ulfsbjorninn 2014; Faust 
and Ulfsbjorninn 2018). 

Here are some representations to illustrate what the interpretations 
look like. We follow Strict CV theory in assuming that apparent branching 
constituents are in fact a sequence of two CV pairs. A position may be empty, 
analysed as a null vowel in Burzio’s terms (Burzio 1994). 
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(14) apparent penultimate stress 
a) apparent branching nucleus b) apparent branching rhyme 
CVCV, CV,C y CVCVCVCV 


PONG 
|A| r JAJ m |A] agenø da 


[U] 
aróma [əróvmə] agenda [adzéndo] 


Both (14a) and (14b) illustrate that counting the nuclei says we place a stress 
on the third V from the last, so the count of nuclei is crucial for the accent 
placement, and this is the case for the word class above, which does not need 
to be treated as exceptional from the antepenultimate stress. Antepenultimate 
accent in Latin is discussed in Scheer and Szigetvari (2005). (14b) shows that 
the penult is an empty nucleus, which is counted for the stress assignment. 
The empty position is licensed by the preceding nucleus which carries word 
stress, which is projected to LT. We also see tonic lengthening in (14a), as the 
following section explicates. 


Apparently short vowels are underlyingly long vowels 


Compare aróma [aróvmə] and aromatic [aromatik]. This stress found on 
aroma [aróvmə] tells us that the stress accented nucleus is projected to LT, 
and that nucleus now licenses an extra V that follows it, to form a diphthong. 
These vowels must occupy 2 independent CVs, namely, CV,CV,. (Different 
analyses are possible; see, for example, Balogné Bérces and Ulfsbjorninn’s 
proposal in Chapter 8.) Also note that loan words with final open syllables 
have a strong tendency of being paroxytonic: banána (from Wolof), basmati 
(from Hindi), plaza (from Spanish), mojito and barracuda (from American 
Spanish). Occurrences of antepenultimate stress are lower, as in cinema in 
(12). The word aroma itself is a loan from Greek via Latin into English. Loan 
word accentuation of this type marks the penultimate vowel as the head of the 
domain; thus the nucleus is projected on the LT, which means that the position 
should have a headed expression on the melodic tier. The position is a licensor 
and will license a CV which is inserted to its right (Scheer 2004). 

Strict CV theory assumes that the stress means an insertion of a CV 
(Scheer 2004), for example, Italian tonic lengthening, and when the nucleus is 
stressed, that nucleus licenses the inserted nuclear position. The stress is on the 
penultimate in words like aróma, and the stressed CV licenses a now inserted 
CV pair of the CV,CV,. Considering the LT, it finds the nucleus of CV, 
while licensed position CV, is not recognised. As for the melodic material, 
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the element from CV, spreads to its licensee CV,, as the manifestation 
of the licensing relationship, which is realised as the second member of a 
monophthong, or a diphthong, [v]. 

The phonetic interpretation of CV,CV, reduces to a single vowel such 
as [a] in [aramatik]. Those vowels won’t be realised as long when licensed by 
another nucleus, as in the case of aromatic [aramatik] where the stress location 
is determined by the morphology. The position is licensed; thus they are not 
headed and so the vowel is realised as reduced. 

The strong tendency of branching constituents to attract word stresses in 
fact comes from the nature of those constituents contracting licensing within 
themselves. The two classes distinguished above, namely branching nucleus 
and branching rhyme, can now be conflated into one category — that of a CV 
position in a licensing relation. Without the extrametricality of the rightmost 
constituent (Hayes 1980), we can readily conclude that the head of a sequence 
of consecutive Vs will attract the word accent. 


Antepenultimate stress and ‘lengthening’ 


Note that words with heavy ‘penultimate syllables’ place the accent on the 
fourth nuclear position from the last as seen in gladiolus [gladidulos]. One 
might still emphasise that the syllable generalisation is necessary. However, 
we are to refute this apparent advantage of a syllable in favour of CVCV, 
that is, the head part of the branching nucleus will take the role of the word 
accent. Recall the Japanese examples, which demonstrated how the licensor 
projection worked. At the same time, if a vowel sequence involving a long 
vowel or diphthong is found in the antepenultimate nuclear point, the one that 
licenses another position is chosen as the head nucleus of the word domain, 
the CV, of the CV ,CV,. The head of the word domain in English manifests as 
the stressed nucleus. 

When the antepenult falls on the licensed position of the branching 
constituent, namely a vowel position/nucleus, we predict correctly that the 
stress will fall on the nucleus/vowel to the left, as in gladiolus [gladidulas]. 
The number of Vs are counted, however, and if the antepenultimate V 
coincides with a licensed position, that is, the second member of a diphthong 
or a monophthong, the stress will be on the one to the left. The LT is where 
the headship of a word domain is determined, where the licensed members are 
not projected. 
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(15) a. diphthong [gladióvləs] b. monophthong [toma:tou] 
LT V V V V 


v 
| 

CVT ..C VC VCVCVCWV CV 
|| 

t o 


VCVvcvcy 
MT ..d HJA 1 |Als ø t 


| 

|A| 
| 

IU] |U] 


Consider the case of apparent penultimate stress, as in tomato or aróma as 
opposed to antepenultimate cinema. When a ‘penult stress’ is observed, the 
stressed vowel appears as a long vowel to form a licensing domain [a:] and 
[ow], resulting in the word placing the stress on the third nuclear point from 
the right edge. Compare aróma [aróvmə] and aromatic [aramatik]. A word 
like tomato provides divergent responses when placing stress on the word 
medial /a/. In British English, it is pronounced [toma:tow], the medial vowel is 
lengthened, and in American English, it is generally pronounced [toméitow]. 
In both cases, the stressed vowel is long: monophthong [a:] and diphthong [e1] 
both form progressive licensing domains. 

Remaining questions regarding the vowel length include the unstressed 
final nuclear sequence of tomato. GP assumes that the English domain-final 
nuclei are parametrically licensed, which means that a word can end with a 
consonant in English. Most vowel-final words in English are of foreign origin, 
which are incorporated into the above parametric setting. In a language whose 
domain-final nucleus is not licensed, for example, Japanese, the final nuclei 
must be filled, thus loanwords from English obtains an element |U| for that 
position as in hamu ‘ham’ (Yoshida 2003). In English, to obtain the licensed 
final-nucleus, the word-final filled vowel becomes a licensor, thus projecting 
to LT. The proposal given by Scheer (2004) and extended here predicts that 
we license an extra CV pair to the right of a final syllable. In words with a 
word-final diphthong, which has an |I| element or |U| element, for example, 
[tema:tou], manifests as the result of those elements spreading onto the Cs, 
leaving the following V empty and licensed by virtue of being domain-final. 
However, I also have in mind words that end in /9/ like [bəná:nə]. The same 
CV insertion applies to have a totally empty CV, with no melody spreading 
onto that C. It is necessary to examine this in more detail in the future. 


Conclusion 


This chapter pursued the possibility of the LT, a tier that controls stress/accent 
placement in phonological system, motivated by the LP (Charette 1990). 
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The argument is based on examples in Japanese and English. Further inves- 
tigation on stress and accent should focus on other languages as well as on 
larger prosodic domains in English. 


Notes 


1. I would like to express my huge gratitude towards Geoff Williams for discussions, invaluable 
comments and improving the manuscript. Remaining errors are mine alone. 

2. Some of the main abbreviations used in this chapter: LT = Licensor Tier; HVD = High Vowel 
Devoicing; LP = Licensor Projection; PG = Proper Government; NP = Nuclear Projection; MT = 
Melodic Tier; CT = Constituent Tier. 

3. Youngberg (2021) offers an alternative view where /N/ is either a nasal vowel in Standard Japanese or 
a syllabic nasal in Kansai Japanese, but this may hinder capturing some facts, for example, realisation 
of /N/ as the labial consonant [m] in emphatic pronunciation in both varieties. 
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Emptiness, schwa, and epenthesis: 
a brief introduction 


Connor Youngberg 


Introduction 


This part of the book focuses on the study of epenthetic vowels, vowels 
defined as schwa, and the study of empty constituents or positions in phono- 
logical structure. Broadly speaking, we can view epenthesis as a process that 
affects morphologically complex words or as a repair for loanwords that 
breaks up illicit, banned, or unfavourable consonant or vowel sequences by 
inserting a vowel or consonant respectively (Kenstowicz and Kisseberth 1979; 
Hall 2011). Schwa can be loosely defined as a central vowel like [9] or [ce] 
which alternates with zero in some languages. French is one such language, 
having a stable vowel [ce] as in [beef] ‘beef’ and an alternating vowel [ce] as in 
[foeve ~ fvø] ‘hair’ which is phonetically identical in word-medial context, but 
which exhibits divergent phonological behaviour (Anderson 1982; Charette 
1991).! This schwa is often transcribed [9] to differentiate its status, exempli- 
fying a different phonological identity to stable [ce] which never varies. 


Emptiness in phonology and Government Phonology 


Emptiness in phonology has a more complicated story and has been recognised 
to various degrees within the literature. The variability of certain vowels in 
phonological studies has led directly to the recognition of the potential of 
emptiness. To account for the alternating behaviour of what we identify as 
French schwa above, Anderson (1982) proposed that schwa is an empty nucleus 
which is realised as [ce]. Within Government Phonology (GP) (Kaye et al. 
(KLV) 1990; Charette 1991) schwa vowels, syncope, and epenthesis are in fact 
interrelated phenomena, and it is proposed that a) epenthetic vowels and schwa 


EMPTINESS, SCHWA, AND EPENTHESIS: A BRIEF INTRODUCTION 


155 


156 


vowels are instantiations of empty nuclei, whose existence and silence is heavily 
regulated, b) empty nuclei are found word medially at sites of vowel-zero alter- 
nations and within certain consonant clusters, and c) that word-final ‘codas’ are 
onsets preceding an empty nucleus (Kaye et al. 1990; Kaye 1990a,b; Charette 
1990, 1991; Harris 1994; Gussmann 2002). Empty nuclei remain silent or are 
realised as a vowel such as [ce] in French or [a] in English based on the medial or 
final position of the nucleus and the relevant conditions licensing their silence. 
Word-final empty nuclei are licensed by a parameter which is ON or OFF, being 
permitted in English /keet_/ [kæt] ‘cat’ but absent in Japanese and subject to 
epenthesis, e.g. [gasu] for English gas (petrol). 


(1) English and Japanese final nuclei 


a. English cat b. Japanese [gasu] 
ONON ONON 
| || | | of | 
k æt ga suwu 


In the word-medial context, empty nuclei are licensed to remain empty and 
silent by their following nuclear constituent. This does not come for free 
and is possible if and only if the following nucleus has a phonetic interpreta- 
tion or contains a segment (Kaye 1990a, b; Charette 1991). If this fails, the 
nucleus is realised phonetically. This licensing is called Proper Government 
(PG) within the GP literature. Kaye (1990a) shows that this regulates alterna- 
tions in the occurrence of Moroccan Arabic [i] in verbal forms of roots like 
/ktb/, such as [ktib] ‘he/she writes’ and [kitbu] ‘they write’ (Kaye 1990a). In 
short, Kaye claims that [#] is the realisation of an empty nucleus which is not 
properly governed and thus phonetically realised; epenthesis is explained not 
by reference to consonant clusters, but to the status of the following nucleus as 
shown below. PG is shown with a solid arrow, while the failure of government 
is shown with a crossed arrow. 


(2) Moroccan Arabic verbal alternations 


a. [kttb] “he/she writes’ b. [kitbu] ‘they write’ 
PG PG 
D T 
ONONON ONONON 
| | | | | | | 
k t H b k [H] t b u 


PG can also be said to regulate the appearance and disappearance of schwa 
(or alternating [œ]) in words like cheveux ‘hair’ [fævø~fvø] or in clitic groups 
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within French phrases such as envie de te le demander (Charette 1991, 199). 
This phrase may be maximally realised as [avi doe tæ læ doeemade], but the 
vowel [ce] in each morpheme within the clitic group may be syncopated in a 
strictly alternating pattern, giving possible grammatical realisations such as 
[avi d toe t dæmăde] or [àvi dæ t læ dmade] but never *[avi d t 1 doemade]. 
Charette (1991) explains that this can be accounted for through PG, which 
gives rise to the on-off nature of epenthesis. 

Epenthesis can be correlated to the occurrence and avoidance of 
certain consonant clusters. This is recognised early in Kenstowicz and 
Kisseberth’s (1979) study of Yowlumne or Yawelmani epenthesis and 
syncope. Investigation of the interlinked relation between consonant clusters 
and epenthesis has led to a wealth of proposals examining the exact reasons 
for the conspiracies outlined by Kenstowicz and Kisseberth. Let us briefly 
consider French. 

While above schwa in French is characterised as an alternating vowel, 
Charette (1991) points out that some vowels realised as [ce] never alternate with 
zero in specific cluster contexts, namely following a cluster of rising sonority 
or falling sonority word medially like brebis ‘sheep’ [bscebi], *[brbi] and 
parvenir ‘reach’ [pasvoenis] *[pasvnir]. The link between epenthesis or schwa 
and consonant clusters is considered by Charette (1990, 1991, 1992, 2003, 
2018) in languages such as French, Polish, Mongolian, and Tangale and the 
author proposes that the realisation (or lack thereof) of empty nuclei is directly 
linked to the existence of consonant clusters in the surrounding environment 
due to licensing requirements within phonological constituents. In a close 
examination of the relation between nuclei and their preceding onsets, Charette 
(1990, 1991) proposes that Government Licensing (GL) is a relation where a 
nucleus licenses not only the preceding onset, but also its ability to branch and 
form either a branching onset or coda-onset cluster. In short, a full nucleus 
can license the maximal syllable in a language, but languages will be forced 
to choose how empty nuclei are dealt with following a consonant cluster. In 
French, empty nuclei are typically unrealised in light syllable contexts like 
semaine ‘week’ [smen ~ scemen]. Word medially, schwa must be realised 
following a branching onset like brebis ‘sheep’ [bscebi], *[bgbi] and following 
a coda-onset like parvenir ‘reach’ [pasveenis] *[pasvnis]. This is because 
this empty nucleus must be realised and not silenced and licensed by PG so 
that it can government license its onset. However, Charette (1991) shows that 
conditions are not the same in word-final context in French words like quatre 
[kats] ‘four’, with different parameter settings regulating word-final and 
word-medial nuclei. Word-medial and word-final full and empty nuclei may 
thus be seen as having different potential. Charette (1991, 1992) also shows 
that the potential of empty nuclei to license clusters differs not only in different 


EMPTINESS, SCHWA, AND EPENTHESIS: A BRIEF INTRODUCTION 


157 


158 


varieties of French, but also cross-linguistically in Portuguese (Charette 1991, 
141), Polish, Mongolian (Charette 1992), Laurentian and Continental French 
(Charette 2003), and Acadian French (Charette 2018). Further work on the 
role of licensing of empty constituents, consonant clusters, as well as para- 
metrical variation and licensing potential in GP approaches includes, but is 
not limited to, work from Harris (1992, 1994, 1997), Lowenstamm (1996), 
Gussmann (2002, 2007), Kula (2002), Scheer (2004, 2012) and Cyran (2010). 
More recently, empty nuclei have been combined with Turbidity Theory by 
Cavirani and van Oostendorp (2017) and Cavirani (2022). 


Emptiness, schwa, and epenthesis in this volume 


The contributions in this part of the book contain experimental, descriptive, and 
theoretical works on epenthesis and vowel-zero alternations which consider 
not only representational and theoretical analyses, but also enlightening 
studies of epenthetic vowels and consonants focusing on empirical results. 
Many of the contributions are influenced by the Government Phonology 
analyses mentioned above, others focus on improving our understanding of 
epenthesis in specific languages with careful considerations of data. 

Some chapters overlap with topics discussed elsewhere in this book, 
but are placed here as they invoke emptiness and its effects within a 
word structure. For example, Kula (Chapter 18) discusses consonant cluster 
occurrence in Bantu languages, crucially invoking the power of nuclei to 
license clusters and silence an empty nuclear position in languages lacking 
tell-tale vowel-zero alternations showing PG activity. Likewise, Cavirani 
(Chapter 15) and Scheer (Chapter 19) discuss consonant-glide sequences in 
French, paralleling nicely with Péchtrager (Chapter 12), though Cavirani and 
Scheer place more emphasis on the link between empty or realised nuclei 
and consonant-glide sequences compared to Péchtrager’s focus on the profiles 
and composition of these sequences. 


A brief summary of the chapters 


Cavirani (Chapter 15) examines consonant-yod (Cj) sequences in French 
words like cimetiére, building on Charette (2003) using Optimality Theory 
style computation and Turbidity Theory to improve our understanding of 
epenthesis and gliding in Laurentian and Continental French. Turning to 
Brazilian Portuguese, Cristéfaro Silva (Chapter 16) considers the role of final 
empty nucleus in Brazilian Portuguese and implements empty nuclei to better 
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understand stress patterns. In an experimental study, Kenstowicz (Chapter 17) 
examines the epenthetic vowels of Connemara Irish, their realisation, and their 
syllabification. Such data crucially provides important empirical clarification 
on the status of epenthesis vowels in Connemara Irish. 

Turning to the question of consonant clusters and licensing, Kula 
(Chapter 18) examines Meinhof’s Law in Bantu languages from a GP point 
of view, arguing that restrictions and variations on NC cluster realisation are 
linked to licensing potentials of nuclei, with NC cluster alterations and lack 
of *NCIVNC2 arising as a result of the competing government licensing 
demands from nuclei to NC clusters, which must govern the empty nucleus 
within. 

Scheer (Chapter 19) examines the effects of consonant-glide sequences 
in French varieties, echoing Cavirani, but with a careful eye on what 
structures may be posited for Cj sequences, focusing on the governing and 
licensing possibilities and their implications. This part of the book concludes 
with a study from Szigetvari (Chapter 20), who neutrally examines plosive- 
zero alternation within consonant clusters in English in words like prince 
[prin(t)s]. He presents results from a new survey of English speakers and 
considers resultant problematic implications for analyses framed within GP. 


Note 


1. Schwa in French is much more complex than the broad and simplified picture given here. Its realisation 
and behaviour varies in the speech of different speakers, generations, speech registers, and French 
varieties. I refer the reader to Anderson (1982) and Charette (1991) for their discussion of the relevant 
data and Detey et al. (2016) for one recent overview of French variation. 
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Turbid government 
Edoardo Cavirani 


Introduction 


Government Phonology (henceforth GP) was developed around the 1980s, 
when the power of transformational rules was being drastically reduced both 
in phonology and in syntax.! The structural description of rules was gradually 
simplified, until it definitively disappeared. This resulted in the formulation of 
very basic rules that apply freely, minimally, and cyclically until the affected 
representation complies with the requirements of well-formedness constraints, 
which, in turn, gained an increasingly important role. 

In syntax, this tendency is witnessed by Government and Binding (GB; 
Chomsky 1981, 1982, 1986), where, for example, movement was formalised 
in terms of a very simple rule — Move a — that applies freely, its result being 
controlled by constraints such as barriers, Empty Category Principle (ECP), 
and, more recently, by the Minimalist Program (Chomsky 1995), where the 
free application of the one and only structure-building operation — Merge — is 
checked against interface constraints. 

In phonology, the anti-rule movement produced autosegmental 
phonology (Goldsmith 1976). Also in this case, well-formedness constraints 
on representations took a prominent role, as the application of a very simple 
rule associating the autosegments to the skeleton was triggered and controlled 
by constraints such as the Well-Formedness Condition, the Obligatory 
Contour Principle, and the No-Crossing-Line Convention. 

From this moment on, phonological research developed along two 
parallel lines. On the one hand, the rule has been totally dismissed, the 
whole computational burden being taken by a system of constraints, which 
can be absolute, as in Declarative Phonology (Scobbie et al. 1996), or 
violable, as in Optimality Theory (Prince and Smolensky 1993). On the 
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other hand, frameworks such as GP and its descendants (for example, strict 
CV; Lowenstamm 1996; Scheer 2004) shifted the focus on the representa- 
tional component, retaining a system of absolute constraints modelled on 
GB — c-command, Minimality Condition, Projection Principle, Structure 
Preservation, ECP, and Proper Government (PG) — and a sort of informally 
described association rule (Scheer and Kula 2018). 

In this chapter, I address two of the constraints that GP inherited from 
the 1980s — ECP and PG — as well as the computational component, namely 
the operation linking melodic elements to root nodes (for further discussions of 
ECP and PG see Cristéfaro Silva (Chapter 16) and Rennison (Chapter 6), for the 
No-Crossing-Line Convention). By analysing an alleged case of ECP violation, 
I aim at showing that such a violation is only apparent, and that PG can be 
retained in its original form, that is, absolute, exceptionless, and local. This 
is allowed by a redefinition of the association operation in terms of Turbidity 
Theory, an approach to input-output mapping that decomposes the autoseg- 
mental symmetrical relation between melodic elements and root nodes into two 
independent relations: projection and pronunciation. This makes it possible to 
distinguish between truly empty nuclei, which show no melodic content, and 
silent non-empty nuclei, which only lack the pronunciation relation (Cavirani 
and van Oostendorp 2017, 2019; Cavirani 2022). The empirical ground against 
which these hypotheses are tested is represented by sequences of two unpro- 
nounced nuclei in French (Charette 2003). Crucially, these sequences are argued 
to violate the ECP, which requires that each word-internal empty nucleus must 
be properly governed by a following non-empty nucleus. I argue that this 
ECP violation is only apparent, as the second of these two silent nuclei is not 
really empty: it is endowed with phonological content, which, though, is not 
pronounced. Crucially, by virtue of reaching a sufficient amount of representa- 
tional complexity, it is entitled to properly govern the preceding empty nucleus 
(for a different account of similar data see Scheer, Chapter 19). 

Besides eliminating apparent cases of ECP violation, the refinement of 
the empty category typology provides a solution to a set of interrelated formal 
issues of PG, such as the unclear relation between representational complexity 
and lateral strength, the dubious status of parametrically determined lateral 
strength, and the behaviour of yers, which, despite being pronounced, cannot 
govern a preceding empty nucleus, nor another yer. 


On some formal issues of PG 


One of GP’s most renowned trademarks is empty nuclei. Something similar 
was already around in the early 1980s (Anderson 1982; Spencer 1986), 
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but it is only in GP that such a device receives a thorough formalisa- 
tion (Scheer and Cyran 2018). This parallels what happened in syntax, 
where, although the debate on empty elements can be traced back to the 
outset of generative grammar (see Cavirani and van Oostendorp 2017 for 
references), empty categories become a pretty standard theoretical device 
with the development of GB. In GB, the distribution of these categories is 
taken care of by PG, which licenses the empty categories/traces left back 
by movement only if governed/bound by an antecedent. PG allows for 
the recovery of the semantic content of the trace, and the derivation of a 
well-formed syntactic representation. Inspired by the Structural Analogy 
hypothesis (Anderson 1985, 1992; den Dikken and van der Hulst 2020), 
Kaye et al. (1990) introduce PG in phonology. As in syntax, PG is meant 
to account for the distribution and empty categories, which in phonology 
usually translates in nuclei that are part of the phonological representation 
but receive no phonetic interpretation: empty nuclei (ENs). Also in this 
case, what is at stake is the well-formedness of representations containing 
empty categories: ENs need to be given the right to stay silent, namely, they 
need to be (p-)licensed. In GP, this is granted via PG, which is dispensed 
by a phonetically realised nucleus occurring in the following syllable. 
Something different needs to be said about final empty nuclei (FENs), 
as they are not followed by any audible nucleus. FENs are thus argued to 
be parametrically licensed. Besides FENs, two other special cases were 
recognised: ENs that are enclosed within an interconsonantal governing 
domain (for example, coda-onset sequences) or precede a s+C word-initial 
cluster (where they are magically licensed). The latter two cases have been 
addressed by strict CV, which gets rid of GP prosodic constituents and of 
the non-proper form of government, such as the one holding in an intercon- 
sonantal domain and the one involved in magic licensing. As a result, the 
theory gains in elegance and simplicity. 

However, as recently pointed out, for example, by Bafile (2020), strict 
CV seems to show some difficulties in accounting for patterns such as the one 
shown by Finale Emilia dialect, where segmentally similar consonant clusters 
are repaired via epenthesis in some forms but not in others. For instance, a 
form such as /‘sal@s@/ ‘willow’ results in ['sales], where an epenthetic [e] 
fills in the rightmost EN. The very same cluster, though, emerges as it is 
in /‘dolO@s@/ ‘sweet’: ['dols]. Here, the first EN stays silent, despite being 
followed by another EN. In strict CV, the difference between the two forms 
has been related to a representational difference between the leftmost ENs: 
whereas the one of the latter form is really empty, the alternating one is 
represented as a N with a floating melodic content (/e/), which gets associated 
to N, thus phonetically interpreted, only if not properly governed. As discussed 
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by Bafile (2020), an important ingredient of such an analysis is a stipulation 
that says that ‘final empty nuclei can only govern nuclei that do not possess 
any floating melody in the lexicon’ (Scheer 2004, 644). Thus, it seems that it 
is necessary to distinguish between truly empty and empty-cum-floater N, or, 
as suggested by Bafile (2020), retreat from the bold EN-proliferating positions 
of strict CV towards the safer standard GP shores, where the difference 
between [‘sales] and [‘dols] is due to the fact that only the former displays an 
EN, whereas in the latter, lateral and sibilant represent a coda-onset sequence. 
In her view, the problem is that ‘the formalism of strict CV theory and its 
conception of empty nuclei is not able to properly characterise the distinction 
between ‘true’ empty nuclei and alternating empty nuclei’ (Bafile 2020, 103). 

In line with what Bafile (2020) observed, I argue that it is necessary to 
develop a more adequate formalisation of the various types of ENs. Differently 
from her, though, I do not take position with respect to whether GP offers more 
adequate tools than strict CV. Rather, I propose a refinement of the represen- 
tational technology that is compatible with both theories and improves on a 
few interrelated drawbacks, such as a) the arbitrary and diacritic-like character 
of the parametrised government strength of FENs, b) the inconsistent relation 
between representational complexity and government strength, and c) the 
visibility of empty-cum-floater N, aka yers. 

The problem in c) should already be quite evident: floaters are by 
definition unassociated melodic primes, and it is thus unclear why a given 
floater should be considered private property of a specific root node before 
some phonological operation (linking/spreading) introduces the relevant 
association relation. At the level of underlying representation, namely prior to 
phonological computation, ENs are just ENs, and a following EN shouldn’t 
be able to discriminate whether there is some melodic prime floating around 
or not. On the other hand, if there is a floater, and that floater is aligned with 
a specific N, then it would be hard to conceptualise the latter differently from 
any other filled N. 

This brings us to b), namely the inconsistent relation between the repre- 
sentational complexity of N and its governing strength. In GP, government 
strength is traditionally assumed to be proportional to representational 
complexity, as governees cannot be more complex than governors, where 
complexity is a function of the number of elements making up the segment 
(Harris 1990). Building on this, Cyran (2008, 2010) develops a licensing 
strength scale, where a three-way distinction is made between full vowels, 
schwas, and ENs. This scale is related to the distribution of onsets (T), 
complex onset (TR), and coda-onset (RT) sequences. For instance, he 
observes that full vowels can be preceded by (that is, they can license) T, RT, 
and TR clusters, schwa by T and RT clusters, and FENs only by (a subset of) 
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T. As claimed by Cyran (2008), ‘the status of schwa in this scale is symbolic, 
and simply means that it represents a “prosodically weaker context than a 
full vowel”. A precise definition of schwa for the purpose of this scale is a 
matter of further research’. I refer the reader to Cavirani and van Oostendorp 
(2017, 2020) for the development of a theory that attributes the different 
behaviour of full vowels and schwas to differences in terms of the elements’ 
internal complexity. Details aside, it is important in this context to stress that 
the scale just referred to rests on the representational difference between the 
relevant segments, irrespective of whether they are pronounced or not. This, 
together with the assumption that empty-cum-floater Ns are representationally 
more complex that ENs, would predict that the former can properly govern the 
latter, no matter if they are not pronounced. 

Note, though, that the pervasive direct relation between representa- 
tional complexity and governing strength breaks down in word-final position, 
as rather than from the former, the latter is traditionally argued to follow 
from the setting of an ad hoc parameter. Some languages have this parameter 
turned on, so their FENs can properly govern. Others have it off, so their 
FENs cannot govern a preceding EN. This is a potential problem, as such 
a parameter looks like a diacritic, namely as something that is not encoded 
in the module-specific phonological vocabulary. In this sense, it looks like 
a betrayal of the autosegmental mantra inspiring GP-related frameworks, 
according to which, given the right representation, a process would follow. 
This encourages translating such a diacritic in more appropriate phono- 
logical, representational terms. This would allow us to tie the governing 
strength of FENs to their representational complexity, and the variation in 
FEN government strength across and within languages to representational 
variation, namely to the lexicon. 

This brings us to a crucial aspect of the hypothesis defended so far. 
I argued that (i) it is necessary to provide a formalism that enables us to 
distinguish between various kinds of EN; (ii) empty-cum-floater Ns are 
suspicious objects, and (iii) we should possibly not give up the direct relation 
between representational complexity and government strength, which would 
in turn allow us to get rid of FEN parameters. As I will discuss below, the 
key to bringing all these pieces together is a formal and explicit distinction 
between phonetic and phonological emptiness. The necessary formal tools 
enabling such distinction are provided by Turbidity Theory (TT). In what 
follows, TT will be introduced, and applied to a case of so-called pseudo- 
emptiness in French (the section below on ‘Pseudo-emptiness and turbid 
government in Canadian French’). As will be shown, this development also 
allows for a more adequate formalisation of empty-cum-floater N, and a 
solution to their alignment and visibility problem. 
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A turbid solution 


Turbidity Theory (Goldrick 2001; van Oostendorp 2008; de Castro-Arrazola 
et al. 2015) is an optimality theoretic approach to faithfulness based on 
containment, which assumes an input-output relationship whereby the former 
is contained in the latter. The containment assumption of TT provides the 
tools to formally express the difference between phonetic and phonological 
emptiness mentioned above. These tools are the two different relations 
linking melodic content to root nodes that derive from splitting the symmetric 
autosegmental relation traditionally assumed by GP-based models. Rather 
than assuming that if an element a is associated to a root node x, x is 
associated to a, TT assumes two asymmetric relations: a projection relation 
expressing the lexical affiliation between melodic primes and root nodes, and 
a pronunciation relation expressing the fact that a set of melodic primes is 
phonetically interpreted in a specific root node. Graphically, these relations 
are represented by two arrows: one from the root node to the melodic prime 
for the projection relation, and one from the melodic prime to the root node 
for the pronunciation relation. This is illustrated in (1), below, where repre- 
sentations are given that show no melodic content and no relation (*,), only 
the projection relation (x,), or both the projection and pronunciation relation 
(x,). The former represents an EN, the latter a full N, and the middle one an 
empty-cum-floater N (henceforth eN). 


Lod 
IA] IAI 
© @ [a] 


In TT, projection relations are part ofthe lexical representation ofa morpheme, 
and cannot be altered because of containment and Consistency of Exponence 
(van Oostendorp 2008). On the other hand, the pronunciation relations can 
be modified in the input-output mapping process. This results from the 
pressure exerted by structural constraints holding on surface representations. 
I argue that the ECP and the related PG can be conceived of as structural 
constraints evaluating the well-formedness of a surface phonological repre- 
sentation. Assuming a direct relation between representational complexity 
and government strength, PG would behave like a constraint favouring surface 
representations where a nucleus lacking the pronunciation relation is followed 
by one which is representationally no less complex.? For this to work, TT 
relations must be included in the calculation of representational complexity, 
together with the number of elements. 
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In (1), complexity would decrease from right to left. The rightmost 
nucleus is the most complex, and is endowed with a full government potential. 
The middle nucleus is slightly less complex, but it contains phonological 
material, so it can exert PG. The leftmost nucleus is phonologically empty, 
and cannot govern. 

Phonetically, the melodic content of the full N is faithfully interpreted, 
as expected. The phonetic interpretation of the other two Ns depends on their 
environment. If followed by a proper governor, they can stay silent, otherwise 
they are interpreted according to their phonological content: ENs are assigned 
a default vowel (usually the less marked vowel of the language), while eNs 
have their melodic content interpreted. 

This approach provides a solution to the issues mentioned in the 
previous section. The problems of the visibility and the alignment of 
floaters in eNs vanishes, as there is no floater to start with. What was 
considered a floater, is now a melodic prime projected by a root node. As 
such, it is integrated in the phonological representation, and it can thus 
be seen by the following N, which can discharge its government power. 
If the latter is missing, the melodic content of an eN is given the chance 
to be heard: it gets its pronunciation relations and becomes audible. As 
for the parametrically determined government strength of FENs, this can 
now be transparently related to representational complexity. This implies a 
distinction between FEN and FeN: the former is phonologically empty, and 
cannot properly govern a preceding EN/eN, whereas the latter is endowed 
with some phonological content, and can properly govern a less complex N 
occurring on its left. 

Another issue that can be solved by such an approach is represented 
by apparent sequences of ENs, namely cases in which an EN seems to 
properly govern another EN. This possibility is in principle excluded by the 
ECP. However, French seems to show such unexpected sequences, which 
leads Charette (2003) to loosen the otherwise absolute binarity and locality 
principles of GP. In what follows, I provide a TT-based solution that allows 
for keeping such principles (a different analysis of these data can be found in 
Scheer, Chapter 19 of this book). 


Pseudo-emptiness and turbid government 
in Canadian French 


In her account of vowel-glide alternation in French, Haworth (1994) introduces 


pseudo-empty categories, namely positions that share their melodic content 
with an adjacent position. This is shown in (2) with the representation of 
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confié [k6f je] ‘confided’. Due to OCP, the |I| element of the penultimate N 
spreads to the following onset empty position, whereas the N itself, being 
properly governed by the word-final N, is silent. By virtue of having some 
lexically defined melodic content, pseudo-empty categories resemble our eN, 
whose melodic content manages to escape government and gets interpreted in 
the adjacent consonantal position. 


(2) PG 
OROR OR 
| | 
N | N N 
x x x x x x 
| | | #7 | 
k 6 f {lj e 


Note that, as the spreading of |I| depends on the availability of a following 
onset, in the Lexicon, the pseudo-empty N in (2) seems to rather correspond 
to a full N: before the concatenation of the past participle suffix é, |I| belongs 
exclusively to the stem final N. It is not shared with any following onset, as 
there is none. It is only after suffixation that the configuration is created by 
which (i) PG can take place, resulting in |I| delinking, and (ii) the OCP is 
violated, thereby triggering |I| spreading. Crucially, given that the pseudo- 
empty N can be silenced by PG, it must be different from a full N. One could 
thus represent it as a yer, but, because of the problems of yers discussed in 
the preceding section, it is possibly preferrable representing the pseudo- 
empty N in (2) as eN (hence, in what follows I will refer to pseudo-empty 
Nas eN). 

As shown in (3a), in the Lexicon, whereas all the other segments 
have both the projection and the pronunciation relation, the final N of the 
root kõfi lacks the pronunciation one. The pronunciation of its melodic 
content is negotiated once this form is fed to the phonological module. If 
it is not followed by a proper governor, it gets its pronunciation relation 
and is phonetically interpreted.* On the other hand, as shown in (3b), if it is 
followed by a proper governor, it must keep silent. In such a case, if there is an 
adjacent empty position, its melodic content can spread to that position. This 
is formally expressed by the introduction of a pronunciation relation linking 
the element to the relevant root node. 
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PG 
(3) a. b. VAN 


O ROR OR OR OR 
rt itu 
N N N N N 

x X X X xX X xX XlX 

tt ¢ 4 riy i 

k 6 f l k 6 f [| e 

[ko f ja 


Charette (2003) moves from Haworth (1994) and considers cases in which 
a properly governed eN is preceded by an EN, namely cases that show 
two silent nuclei in a row, violating ECP. She compares Continental and 
Quebec French, and identifies two structures, which differ with respect to the 
consonant surrounding the EN. If the EN is preceded by a ‘potential onset 
governor’ (that is, the first segment of a complex onset) and followed by a 
‘potential onset governee’ (that is, the second segment of a complex onset), 
the two consonants do not enter in a (onset-to-onset) governing relationship. 
In this case, the EN cannot be properly governed and needs to be phoneti- 
cally interpreted. See, for example, atelier [atal'je], where the EN occurring 
between ¢ and / cannot be properly governed by the eN, which is itself properly 
governed by the following full N and surfaces as a glide. Conversely, if the 
EN is preceded by a ‘potential onset governee’ and followed by a ‘potential 
onset governor’, the two consonants either do not enter into a governing 
relationship and the EN behaves as in the previous case, as in Quebec French 
cimetière [simat jer], or they do enter in a governing relationship. In this case, 
the EN is properly governed, as in Continental French cimetière [simt'jer]. 
Thus, the presence of a governing relationship between the two consonants 
flanking an EN seems to be a condition for ENs to be properly governed by 
a following eN. Another condition should hold, though, namely the presence 
of an external proper governor. This is because, ‘if one were to argue that 
onset-to-onset government alone was responsible for the p-licensing of the 
[relevant EN], it would still remain a mystery why this condition plays no 
role in other contexts’, namely why i) ‘a p-licensed [EN] is never a proper 
governor regardless of the consonants surrounding the empty governee’, 
and ii) ‘an unlicensed nucleus can always be a proper governor whether or not 
the empty nucleus is contained within an onset-to-onset governing domain’ 
(Charette 2003, 476). 

The problem with this hypothesis is deciding whether the relevant EN — 
for example, the one occurring between m and ¢ in (4) — is properly governed 
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by the following eN (which is in turn properly governed and surfaces as a 
glide), or by the full N further to the right — for example, (4)’s /e/. 


4) O ROR OR OR OR 


| | | | | 
N | N]|NJ]N{N 


FL id] 
Li oh 


x 
| 
m t lj € r 


x 


x 


One possibility is that ‘the proper government of a [eN] does not exhaust 
the governing potential of its proper governor [and that the] latter can 
therefore [...] properly govern two nuclei’ (Charette 2003, 466). Alternatively, 
one could argue that ‘a p-licensed [eN] act as a proper governor and assure the 
p-licensing of [a EN]’ (Charette 2003, 474). In her paper, she starts exploring 
the latter option, but she eventually settles on the former. 

Both options come with their own problems. Assuming that a full 
N properly governs the eN as well as the preceding EN ‘implies relaxing 
the condition on strict adjacency between proper governor and governee’ 
(Charette 2003, 476) and the binarity principle according to which a proper 
governor can only govern one empty position. On the other hand, ‘claiming 
that a p-licensed position can act as a proper governor forces us to relax the 
conditions on what constitutes a proper governor [...] How could therefore any 
other type of p-licensed position be different? The condition is on p-licensing 
in general, not on the type of p-licensed category’ (Charette 2003, 476). 
Crucially, I argue that p-licensed positions can in fact differ from each other 
in terms of complexity, and that it is the latter that determines what governs 
what. As claimed throughout the chapter, this grounds on the possibility of 
distinguishing between two asymmetric root-melody relations, which can 
enter the calculation of complexity. Once this is taken into consideration, 
Continental French cimetière [simt'jer] can be represented as in (5). 


(5) PG PG 


O ROR OR OR OR 
rf oa tudor. 
N | N | NIN N 
x X X X X X X X XK X 
ttt ify ii 
s im t |l € 
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In (5), the full N /e/ properly governs the preceding eN. Accordingly, the 
latter is assigned no pronunciation relation and keeps silent, whereas its 
melodic content spreads to — that is, is pronounced on — the following 
empty onset. Crucially, as the eN is representationally more complex than 
the preceding EN, it can discard its PG power on the latter, which gets no 
phonetic interpretation. Note that, as in the original analysis of Charette 
(2003), the presence of an external eN proper governor is not enough for 
the silencing of the relevant EN, as it would be impossible teasing apart the 
behaviour of forms where EN is properly governed (for example, cimetiére 
[simt'jer]) and that of forms where PG does not apply (atelier [atal'je]). Thus, 
a further condition referring to the possibility of the consonants flanking the 
relevant EN to create an interconsonantal governing domain is necessary 
in the analysis proposed in this chapter too. I leave the refinement of the 
interaction between this condition and the lateral actorship of eNs to future 
work, and I refer the reader to Scheer (Chapter 19) for an analysis building 
solely on interconsonantal government. 


Conclusion 


In this contribution, I moved from a proposal put forward by Charette (2003), 
who suggests that, in order to account for some cases of sequences of silent 
Ns, we need to (i) relax the condition on strict adjacency between proper 
governor and governee, and (ii) give up on the binarity principle that allows a 
proper governor to govern only one empty position, and I suggested that such 
a departure from standard GP assumptions can be avoided by exploiting the 
representational possibilities provided by Turbidity Theory. More specifi- 
cally, I proposed that the formal distinction between ‘really’ empty nuclei 
(ENs) and silent nuclei endowed with some phonological content (eNs) 
granted by Turbidity Theory allows for maintaining a tight correlation 
between representational complexity and lateral actorship. In particular, 
I maintained that, given that eNs are more complex than ENs, the former can 
properly govern the latter even if it is itself properly governed. Provided we 
accept the extra condition concerning the governing domain established by 
the two consonants flanking the EN preceding an eN, this crucially allows 
for accounting for sequences of silent nuclei without departing from strict 
adjacency and binarity. 
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Notes 


1. This work is supported by the FWO Marie Skłodowska-Curie Actions — Seal of Excellence Postdoctoral 
Research Grant Nr. 12Z7520N. I would also like to thank Jean Lowenstamm, Radwa Fathi, and Tobias 
Scheer for useful comments. 

2. See Polgárdi (1999), Harris and Gussmann (2002), Cavirani (2015) for attempts at translating GP 
mechanisms into constraints, and Brandão de Carvalho (2021) for a discussion of this possibility. 

3. The (lack of) pronunciation of FeN can be taken care of by a constraint ranking where a constraint 
favouring CV sequences, for example, NoCopa, dominates a constraint penalising unparsed melodic 
content, for example, PARSE or PRONOUNCE. Due to space limitations, it is not possible to discuss this 
point further, but see Harris and Gussmann (2002) for a similar approach. 

4. This happens, for example, in the present tense, where the element gets faithfully interpreted. As 
an example, see Isg confie, pronounced as [k6'fi]. In such a case, one might assume an EN/eN 1sG 
marker. As this is not more complex than the root-final eN, the latter cannot be properly governed, and 
it gets phonetically interpreted. 

5. Following Charette (2003), I assume that the glide corresponds to the phonetic interpretation of an eN 
silenced under PG, despite the fact that the paper provides no evidence supporting the hypothesis that 
the relevant piece of melodic content is doubly linked. 

6. Charette (2003, 475) raises a further question, namely whether a properly governed pseudo-empty N 
can govern license the head of an onset-to-onset governing domain. Her answer is positive, which we 
can take as representing further support for the lateral actorship of eN. 
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Word-final onsets: a Brazilian 
Portuguese case study 


Thais Cristofaro Silva 


Introduction 


This chapter considers word-final onsets in Brazilian Portuguese (BP). It 
intends to support the view that word-final consonants are syllabified as 
onsets followed by an empty nucleus (Kaye 1990, Charette 1991, Harris 1994, 
Lowenstamm 1996, Piggott 1999, Harris and Gussmann 2002).!' In order 
to understand how word-final consonants are syllabified in BP we have to 
consider how post-tonic zero-vowel alternation currently takes place in BP. 
Both BP and European Portuguese (EP) traditionally had a restricted set of 
word-final consonants: /S, R, N, 1/. Consider (1). 


(1) Word-final consonants in Portuguese 


Orthography EP BP R Gloss 
/S/ mês [mef] [mes] *[mefi] or *[mesi] ‘month’ 
/R/ mar [mar] [mah] *[marł] or *[mahi] ‘sea’ 
/V sal [sat] [saw] *[sali] or *[sawi] ‘salt’ 
/N/ tom [to] [tò] *[tõ] or *[tõòi] ‘tone’ 


The first column in (1) lists the consonant that occurs word-finally. The second 
column lists the orthography for each word. The third and fourth columns list 
the data from EP and BP respectively. The fifth column indicates that in these 
examples a word-final vowel does not occur, be it in EP or in BP. The last 
column presents the gloss. Notice that if a final vowel were to occur it would 
be different in EP and BP. EP has [#] and BP has [i] as vowels that manifest 
when an empty nucleus is spelled out. Notice also that /S/ and /R/ have specific 
surface forms in EP and BP.” The lateral is vocalised in BP and /N/ triggers 
vowel nasality in both varieties. Our discussion is restricted to /S/ and /R/. 
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In order to account for the word-final consonants in Portuguese, it has 
been proposed that word-final nuclei are parametrically licensed (Miguel 
1993). Some languages, like English or French license word-final nuclei 
and a number of consonants occur word-finally (Kaye 1990; Charette 1991). 
However, the restricted number of word-final consonants illustrated in (1), 
may posit a problem to Portuguese, as some other word-final consonants may 
appear word-finally where zero-vowel alternation is observed (Miguel 1993, 
Assis 2017). Consider (2). 


(2) Word-final zero-vowel alternation in Portuguese 


Orthography EP BP Gloss 
/s/_ doce [dost] ~ [dos] ['dosi] ~ ['dos] ‘sweet’ 
/f/ lanche Daf ~ [14] [ lafi] ~ [1a] ‘snack’ 
/t/ pare [ pat] ~['pac] ['paci] ~[' pac] ‘stop’ 
MV vale [val] ~ [val]  ['vali]~[‘val] ‘valley’ 
/n/ cone [ kõni] ~ ['kõn] [‘k6ni] ~['k6n] ‘cone’ 


The examples in (2) illustrate cases of word-final zero-vowel alternation 
in EP and BP, where words end in sibilants, taps, laterals, or nasals. Other 
consonants may also occur word-finally in EP and BP.* What is important 
to observe is that in (2) a word-final vowel may occur, whereas in (1) a 
word-final vowel is not allowed. 

It appears that EP and BP accept any consonant in word-final position 
when a zero-vowel alternation occurs. Word-final consonants in Portuguese, 
like in other languages, seem to have appeared as a consequence of unstressed 
vowels weakening and loss (Kager 1989, 2007). Whereas vowel weakening 
and loss has been documented in EP for a long time (Mateus 1974; Mateus and 
Andrade 2002), BP is at a stage where vowel weakening, reduction, and loss 
has only been recently reported (Leite 2006; Dias and Seara 2013; Albano and 
Meneses 2015; Assis 2017; Freitas 2019). 

A question we posited is how to interpret the facts presented in (1) 
and (2), which present word-final onsets. Do both cases reflect the same 
phenomenon, or do they express different phonological interpretation? In 
order to address this question, we will take into consideration the notion of 
Proper Government and the Empty Category (Kaye 1990, Kaye et al. (KLV) 
1990; Charette 1991; Harris 1994; Lowenstamm 1996; Harris and Gussmann 
2002) to analyse zero-vowel alternation in Portuguese and explain the facts 
illustrated in (1) and (2). In the next section of this chapter we consider empty 
nuclei in EP. After that, we consider empty nuclei in BP where we propose 
an account for zero-vowel alternation and also explain the different sets of 
word-final onsets. 
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Empty nuclei in European Portuguese 


A number of works have considered zero-vowel alternation in EP (Barbosa 
1965; Mateus 1974; Andrade 1996; Mateus and Andrade 2002). Miguel 
(1993, 2003a,b) specifically adopted the Government Phonology framework. 
In order to account for word-final consonants that cannot be followed by a 
vowel, as shown in (1), Miguel (1993) assumed that word-final nuclei are 
parametrically licensed, and thus not phonetically manifested. 

Miguel’s (1993) work also explains the zero-vowel alternation in 
pretonic position in EP by assuming that Proper Government operates 
from right-to-left. Her analysis accounts for the similar pronunciations 
of the verbs superar ‘to overcome’ and soprar ‘to blow’ in EP, as shown 
in (4). 


(4) Proper Government in pretonic position in EP 


(a) O N N O 
| | | | 

x xX x x 
| | | | 
s u Ø r 


T= XxX = oO 


Both words in (4) are pronounced as [su'prar] in EP. In (4a) we have the word 
superar where the primarily stressed nucleus filled with the vowel [a] properly 
governs the nuclear position to its left, which then does not have phonetic 
content. In (4b) the soprar presents a branching onset. Notice that in both 
diagrams the word-final nuclei are not phonetically manifested as they are 
parametrically licensed. However, EP also presents zero-vowel alternation in 
post-tonic position. Consider (5). 


A E3 
ON NON 
fe | | | 
x x x X xX 
| | | | 
3 Ora 


5 — xX — O 


Miguel (1993) proposed that zero-vowel alternation in post-tonic medial 
position is accounted for by Proper Government which, in this case, operates 
from left-to-right as shown in (5). A problem her analysis faces concerns the 
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direction of Proper Government, which is assumed to be language specific 
rather than variable (Kaye et al. 1990; Charette 1991). Whereas Miguel’s 
(1993, 2003a) analysis proposes that in pretonic position Proper Government 
operates from right-to-left, in post-tonic position it operates from left-to- 
right. Miguel’s analysis faces further problems in cases where a word-final 
consonant occurs (cf. 1, 2). Consider (6). 


(6) 
y 
O R O N 
N 
| 
x X XX X 
T | | 
Il a NÍ Ø 


The representation in (6) illustrates the word /as [laf] ‘wool (pl.)’ and lanche 
[laf] ‘lunch’ in EP. Notice that, the word /as belongs to examples in (1) where 
the vowel is not manifested, whereas the word lanche belongs to examples 
in (2) where zero-vowel alternation occurs. According to Miguel’s (1993, 
2003) analysis the empty nucleus is parametrically licensed word-finally, thus 
not explaining the difference between lãs and lanche as shown in (6). Her 
analysis faces two problems: the variable direction of Proper Government and 
word-final onsets (cf. 1, 2). In the next section we consider empty nuclei in 
BP with the aim of understanding how word-final onsets can be explained in 
cases similar to (1) and (2). 


Empty nuclei in Brazilian Portuguese 
Word-final onsets in BP have been traditionally assumed to be restricted to the 
same consonants as in EP: /S,R,1,N/. Examples in (1) illustrated cases where 


the word-final onset in preceded by a primarily stressed vowel. Consider (7). 


(7) Word-final consonants in Portuguese 


Orthography Gloss 
a. /S/ lápis [lapis] ~ [laps] ‘pencil’ 
b. /R/ açúcar [a'sukah] ~ [a'suka] ‘sugar’ 
c. A fóssil [ fosil] ~  [fosiw] ‘fossil’ 
d. /N/ viagem [viaz] ~  [vi'azi] ‘tone’ 
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Examples in (7) show that word-final onsets filled with any of the consonants 
/S,R,1,N/ may be preceded by an unstressed vowel. Cases like in (7a) tend to 
have the post-tonic vowel deleted (Cantoni 2009; Soares 2016). Cases similar 
to (7b) tend to have the final onset deleted (Oliveira 1997; Huback 2006). In 
(7c) either a lateral or a posterior glide may occur. Cases as in (7d) tend to be 
subject to nasal diphthong reduction (Schwindt and De Bona 2017). We will 
focus on cases like (7a) as the discussion of cases (7b—d) would take us away 
from the purpose of this chapter. Consider (8). 


(8) Word-final consonants followed by a sibilant 


Orthography Gloss 
a. cheques [ fekis] ~ [eks] ‘cheques’ 
b. aves [avis] ~ [avs] ‘birds’ 
c. clubes [klubis] ~ ['klubs] ‘clubs’ 
d. jegues ['3egis] ~ ['3egs] ‘donkeys’ 


Examples in (8) are plural forms. They represent a recurrent pattern in BP 
where a post-tonic high front vowel may be deleted when followed by a 
word-final sibilant (Cantoni 2009; Soares 2016). Data in (8) are accounted for 
by a sequence of two adjacent nuclei: cheques /{ek@s@/. Notice that assimila- 
tion does not take place between the consonants, as they are not adjacent since 
an empty nucleus intervenes between them. The word-final empty nucleus is 
accounted for as being parametrically licensed. Regarding the medial nucleus 
we suggest that it is governed by the preceding stressed vowel defining a 
metrical governing domain. We will develop this proposal in the following 
pages, after considering other properties of post-tonic non-final vowels in BP. 
Consider (9). 


(9) Series of empty nuclei* 


Orthography Final vowel Final Consonant Gloss 
a. hóspede ['ospidzi] ~ ['ospdz] ‘guest’ 
b. hipótese [i potfizi] ~ [i'potz] ‘hypothesis’ 
c. hélice [elisi] ~ [els] ‘propeller’ 


Examples in (9) present words where two high front vowels occur in 
post-tonic position. The high front vowels may be deleted, so that a sequence 
of consonants occur. These words are nouns which can have a plural form 
where a final sibilant is added as a plural marker, leading to a sequence of 
three post-tonic empty nuclei: hóspedes /‘ospOd30sO/ ‘guests?’ (cf. 9a). In 
order to understand sequences of two and three post-tonic empty nuclei we 
have to consider penultimate stressed words. Consider (10). 
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(10) Antepenultimate stressed words 


Orthography Gloss 

a. xícara [fikara] ~ [fikra] ‘cup’ 

b. córrego [‘kohegu] ~ ['kohgu] ‘stream’ 
básico [ baziku] [ bazku] ‘basic’ 


Data in (10) show that the vowel which follows the primarily stressed vowel in 
antepenultimate stressed words can be deleted (Amaral 1999; Ribeiro 2007). 
In order to account for the zero-vowel alternation shown in (10) we will 
consider Segundo’s (1994) proposal for antepenultimate stress assignment in 
BP. Consider (11). 


(11) Antepenultimate stress template in BP 


§ w 
Lo] 
i j ji 
xX —>x x 

The diagram presented in (11) illustrates the metrical template for antepe- 
nultimate stressed words in BP (Segundo 1994, 157). The primarily stressed 
nucleus (N,) metrically governs locally (N,), and also governs (N,). The 
binary foot is formed by the metrical domain which projects (N,) and 
(N,), as only the projected nuclei are accessible to the metrical structure. 
According to Segundo (1994, 160) ‘metrical structure is not aware of the 
existence of two nuclear positions under the primary stress node when it is 
constructed’. We suggest that her proposal is in accordance with Kager’s 
(1989, 2007) proposal for the interaction of the metrical domain and syllabi- 
fication. Thus, one expects that syllable structure and metrical structure will 
be related. 

Segundo (1994) argues that as a governee (N,) should not branch, and 
this is the case in BP. Words such as */irismo or *colheita are not allowed 
in BP, as the former would have a post-tonic branching rhyme and the 
latter would have a post-tonic heavy diphthong. However, words as lirismo 
‘lyrism’ and colheita ‘crop’ occur, where penultimate stress is observed. 
She also observes that as a governor (N,) presents a greater number of 
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different vowels than (N,) and also that there is a relationship between 
vowel quality in (N,) and (N,). 

Notice that a primarily stressed nucleus in antepenultimate stressed 
words may have a branching rhyme, as in fósforo ‘phosphorus’ or circulo 
‘circle’, but it may not have a branching nucleus: *géologo or *cdotico. This 
is because, primary stress in BP must be assigned to the final, penultimate 
or antepenultimate vowel, but not to the fourth-to-last vowel. Thus, gedlogo 
‘geologist’ and caótico ‘chaotic’ are good words in BP.° The reason why 
branching nuclei are not allowed in primarily stressed positions in antepenul- 
timate stressed words is that such a position governs the nuclear positions to 
its right, as shown in (11). 

The proposal we have presented accounts for the intermediate nuclear 
position in antepenultimate stressed words not being manifested, as in hóspede 
‘guest’ (cf. 9) and xicara ‘cup’ (cf. 10), as well as plural forms with three 
post-tonic empty nuclei as in hóspedes ‘guests’, as shown in (12), since any 
post-tonic medial position may be metrically governed. 


(12) o 


= 


N N N N 
| 
x XXXXXXXX 
| | | | | | 
o s p @d OO s O 


We still have to account for zero-vowel alternation in word-final position in 
BP (cf. 2). Consider (13). 


(13) Word-final zero-vowel alternation in Brazilian Portuguese 


Orthography Gloss 
/pl jipe ['3ipi] ~ [3ip] ‘jeep’ 
/g/ sangue ['sagi] ~ [sag] ‘blood’ 
If! bife ['bifi] ~ [bif] ‘beef 
/v/ chave [Javi] ~ [fav] ‘key’ 
[f peixe [‘peifi] ~ [pef] ‘fish’ 
/3/ hoje [ozi] ~ [03] ‘today’ 
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Data in (13) illustrate cases where a high front vowel is deleted word-finally. 
Assis (2017) considered cases where /s,f,z,3,f,v,p,b,t,d,k,g,m,n/ occurred 
word finally in BP, where a word-final vowel occurred or not. We have to 
explain why a vowel may or may not occur in cases like (13) but a vowel is 
not allowed to occur after some word-final consonants, as illustrated in (1). 
We suggest that any post-tonic nuclear position filled with a high front vowel 
may be metrically governed by a preceding governor nucleus, so that it is 
not phonetically manifested. This proposal is similar to Segundo’s (1994). 
Consider (14). 


(14) Licensed and metrically governed word-final empty nuclei 


(a) (b) 


ON ON ON ON 
on | bel de 

x XXX x XXX 
| | | | | | ot | 
pas Ø pas Ø 


The diagram in (14a) illustrates the representation of the word paz ‘peace’, 
which presents a word-final empty nucleus that is parametrically licensed, 
and for the purpose of illustration is underlined in (14a). The diagram in 
(14b) illustrates the representation of the word passe ‘entry ticket’ where 
the nuclear position filled with the primarily stressed vowel in (14b) 
metrically governs the word-final nuclear position, which may or may not 
be manifested. 

The proposal we presented accounts also for a sequence of two and 
three post-tonic nuclei (cf. 12). A word-final empty nucleus may be either 
parametrically licensed or it may be metrically governed by the nuclear 
position which bears primary stress. Any post-tonic medial nuclear position 
(cf. 9, 10) may also be metrically governed. If the post-tonic position is not 
metrically governed, then the vowel is phonetically manifested. The metrical 
foot is formed by the primarily stressed vowel which is projected to the 
metrical structure and the word-final nucleus. The metrical governing domain 
we proposed, which is based on the interaction of the metrical domain and 
syllabification, offers a unified account of word-final onsets in BP and also 
accounts for zero-vowel alternation in post-tonic position, be it medial or final 
(Segundo 1994; Kager 1989, 2007). 

We posited the question whether cases in (1) and (2) reflect the 
same phenomenon, or they express different phonological interpretation. 
Our proposal suggests that cases where a vowel is not manifested (cf. 1) 
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are accounted for by the parametrical licensing of the word-final nucleus. 
Cases which display zero-vowel alternation in post-tonic positions are 
accounted for by metrical governing relations where an empty nucleus may 
or may not be phonetically manifested. Thus, cases like those in (1) and (2) 
are phonologically interpreted in different ways. This proposal offers a better 
analysis than the one proposed by Miguel (1993, 2003a), which assumed that 
cases in (1) and (2) had similar phonological interpretations, although they 
characterise different phenomena. 

A number of issues may be raised as this system of vowel-zero 
alternation develops. Some of the parametrically licensed word-final 
empty nuclei in Portuguese undergo segmental change of the final 
consonant as the plural is added: mar [mah]x[maris] ‘sea(s)’ and més 
[mes]x[mezis] ‘month(s)’.’ Notice that word-final consonants that may 
present a post-tonic vowel do not display similar segmental changes: 
torre(s) [toh]x['tohs] ‘tower(s)’. It will also be interesting to observe 
how zero-vowel alternation develops in pretonic position in BP and see 
whether a phonetically motivated process of vowel weakening and loss may 
become a productive phonological phenomenon (Napoleão de Souza 2014; 
Nascimento 2016). 


Conclusion 


This chapter considered word-final onsets in Brazilian Portuguese (BP) 
offering evidence for word-final empty nuclei. It also considered how 
post-tonic empty nuclei in BP are phonologically represented. Word-final 
consonants in BP have different representations in cases where a word-final 
nucleus must not be spelled out (cf. 1), and cases where zero-vowel 
alternation occurs (cf. 2). We proposed that a metrical governing domain 
operates from the primarily stressed vowel to the post-tonic nuclear 
positions to its right. Thus, post-tonic zero-vowel alternation in BP is 
accounted for by metrical governing relations that are established at the 
level of lexical representation. A word-final post-tonic nucleus that is 
parametrically licensed is not phonetically manifested (cf. 1). Otherwise, 
zero-vowel alternation takes place under metrical governing relations 
(cf. 2). Our proposal explains why in BP some word final empty nuclei will 
not be spelled out (cf. 1) whereas others will display zero-vowel alternation 
(cf. 2). It appears that BP is at a developmental stage where it formerly 
preferred word-final filled nuclei whereas now it tends to favour word-final 
empty nuclei. 
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Notes 


1. I would like to thank Yuko Yoshida, Connor Youngberg, and Florian Breit for their dedication to 
organising this volume. I would also like to thank CNPq (PQ-311934/2020-0) and FAPEMIG 
(PPM-X-00702-18). 

2. BP may also present an alveopalatal sibilant in some dialects, as Rio de Janeiro, for words like ‘més’. 
It may also present a tap in some dialects, as São Paulo, for words like ‘mar’. The examples were 
transcribed as in the variety of Belo Horizonte (state of Minas Gerais). 

3. There are restrictions as to whether a vowel may or may not occur depending on whether a branching 
onset or a heterosyllabic cluster precedes the nucleus which may not be manifested. This issue is not 
relevant to the present discussion. Please refer to Miguel 1993. 

4. In words like hóspede and hipótese the second-to-last vowel may be pronounced with a high vowel, as 
shown in (9) or with a mid-vowel, as in ['ospedg3i] and [i'potfezi]. We adopted the high vowel as it is 
relevant to the current discussion. 

5. The only exception is the word ndutico ‘nautical’. 

6. Assis (2017) did not consider the rhotics /R,c/ which alternate in verbal morphology, palatals /A,p/ 
which tend to be vocalised, and the lateral /I/ which is vocalised word-finally. 

7. The lateral undergoes more specifically changes: sal [saw]x[sajs] ‘salt (pl.)’. To discuss these cases 
would take us away from the purpose of the current chapter. The nasal /N/ does not display any 
segmental changes as the nasal vowel just receives the plural morpheme: /soNS/ [sds] ‘sounds’. 
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A note on the svarabhakti vowels 
in Connemara Irish 


Michael J. Kenstowicz 


Introduction 


The svarabhakti (epenthetic vowels) in Scottish Gaelic (SG) were first 
described in detail in a series of studies by Carl Borgstrom (1937, 1940, 
1941) based on fieldwork in the 1930s with speakers from the island of 
Barra and later from Skye and Ross-shire.! Borgstrom postulated a contrast 
in the syllabic affiliation of segmentally identical morpheme-internal VCV 
sequences such as marbh [ma.rav] ‘dead’ versus aran [ar.an] ‘bread’. As 
indicated by the orthography, the CV.CVC structures such as marbh contained 
an etymologically inserted copy vowel while the CVC.VC structures such as 
aran contained an ‘organic’ vowel that displayed the more reduced range of 
vowel contrasts that were possible in unstressed syllables. Borgstrom reported 
that the epenthetic vowel of structures like marbh [ma.rav] displayed a stress 
prominence equivalent to the first vowel while organic CVC.VC structures 
like aran [ar.an] had a strong-weak falling contour. Given that SG has 
regular word-initial accent, the equal stress would support the idea that V1 
and V2 are both constituents of the first syllable in the svarabhakti CV.CVC 
structures; and by bearing stress they would protect V2 from the reduction of 
phonological contrasts found in unstressed short syllables. Borgstrom also 
indicated that native speakers judge the CV.CVC structures as monosyllabic, 
providing additional evidence for the contrast with disyllabic organic stems. 
The svarabhakti contrast was subsequently documented by other researchers 
for more Scottish dialects (Oftedal 1956). 

Kenstowicz and Kisseberth (1979) called attention to this unusual 
contrast in light of the (re)introduction of formal models of syllabification into 
generative grammar. In general, when grammatical junctures are controlled 
for, languages have predictable syllable parsing. The SG data stood as a 
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significant challenge to this thesis. This is especially remarkable because the 
postulated CVCC inputs to the epenthesis are not in general manifested in any 
of the svarabhakti stems’ range of surface alternants. Over the intervening forty 
years the SG svarabhakti contrast has been the subject of numerous studies 
that have largely corroborated Borgstrom’s original observations and taken 
up the challenge they pose for models of prosody. These studies approach the 
issue from a variety of theoretical perspectives and include, among others, 
Clements (1986), Hind (1996), Bosch and de Jong (1997), Ladefoged et al. 
(1998), Bosch (1998), Smith (1999), Hall (2006), Hammond et al. (2014), 
Tosad (2015), Stanton and Zukoff (2018), and Morrison (2019). 

Experimental results bearing on the SG svarabhakti contrast can be briefly 
summarised as follows. Bosch and de Jong (1997) report some observations 
on 102 CVRVC structures excerpted from recordings of conversations with 
an elderly Barra Gaelic speaker. The most significant of their findings concern 
the durations of the first and second vowels of the VCV strings as well as their 
pitch contours. V1 was found to be longer in the organic sequences compared 
to svarabhakti ones while V2 had the opposite duration profile. As the authors 
observe, the first observation makes sense under Borgstrom’s proposal that 
the copy vowel extends V1 and shares its stress; but the ‘backwards’ VC.V 
parsing of organic sequences would naturally tend to shorten the tautosyllabic 
V1, contrary to their finding. Bosch and de Jong also identified an FO contrast 
between the two structures. Organic CVCVC sequences had a falling pitch 
contour starting at V1 while in epenthetic structures the FO peak tended to fall 
at the onset of V2. Ladefoged et al. (1998) collected and analysed data from 
11 elderly native speakers from the Lewis dialect. They confirm Borgstrom’s 
discovery that the svarabhakti sequences are judged as monosyllabic and 
document a falling pitch contour for organic ballag [palYak] ‘skull’ versus 
a sustained rising one for balg [pal¥ak] ‘belly’ comparable to Bosch and de 
Jong’s Barra speaker. Ladefoged and colleagues also suggest an alternative 
to Borgstrem’s analysis of the contrast as a difference in the affiliation of 
the medial consonant; they suggest that the svarabhakti sequence remains a 
monosyllable with the copy vowel simply extending the articulation of the 
stressed V1 through the sonorant consonant. On this proposal ballag parses as 
disyllabic V.CV while svarabhakti ba/g constitutes a single syllable with two 
vocalic sonority peaks. This structure expresses the native speaker perception 
of svarabhakti sequences as monosyllables directly. Finally, Hammond et 
al. (2014) investigated the svarabhakti contrast in several psycholinguistic 
judgement tasks with 18 SG native speakers (dialects not indicated). In one 
experiment speakers matched VCV sequences extracted from a corpus of 
spoken words like epenthetic [ara] of bargan ‘bargain’ versus the organic 
[ara] of marag ‘blood pudding’ with forced choice VCC versus VCVC 
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orthographic sequences. Speakers distinguished among the excerpted VCV 
stimuli at the margins of statistical significance (p > (|z|) = 0.04). Acoustic 
analysis of the stimuli confirmed earlier findings of differences in vowel 
length and pitch contour on the basis of which the subjects were presumably 
forming their judgements. Hammond et al. (2014) also conducted finger 
tapping tests of syllable count that found svarabhakti sequences to have less 
positive disyllable syllable responses compared to the organic structures. 
The speakers’ judgements were not categorical, however. Based on this 
observation the authors conclude that the svarabhakti vowels are present in the 
native speaker’s phonological consciousness and hence should not be treated 
as excrescent vowels as suggested by Hall (2006). 

The svarabhakti vowels have also been documented and analysed for 
various dialects of Modern Irish. Studies include Noyer (1990), Ni Chiosáin 
(1991), and Cyran (1996). Carnie (1994), Ni Chiosain (1999), and Fullwood 
(2013) focus on the particular consonant clusters that allow or disallow the 
epenthesis. In Irish distinctions among unstressed short vowels have been 
significantly curtailed compared to SG. In general, they are transcribed 
with schwa and are subject to considerable variation as a function of the 
surrounding consonants — in particular the co-articulatory effects of pala- 
talisation and velarisation. Nevertheless, the svarabhakti contrast exists in 
the orthography reflecting an historical sound change: dorn [taran] ‘fist’ 
versus doras [taras] ‘door’. Due to the vowel reduction Irish lacks the tell-tale 
copy-vowel sequences that mark the svarabhakti structures in SG and so raises 
the question of whether or not the orthographic distinction is merely a relic of 
the language’s history with no synchronic grammatical validity. An important 
observation that motivates drawing a phonological distinction between the 
svarabhakti and organic VCV sequences is the phenomenon of inflectional 
palatalisation. For many noun stems an oblique or plural form is marked by 
the insertion of palatalisation on the final consonant or consonant cluster of 
the stem. Due to the strong co-articulatory effect of palatalisation on preceding 
vowels, the distinction between the citation and inflected forms of such words 
is quite noticeable. Svarabhakti sequences show palatalisation effects across 
the entire sequence while organic sequences confine palatalisation to the 
stem-final rhyme: doirn [tari] versus dorais [tarif]. 

Assuming that the inflectional palatalisation as well as the orthographic 
difference provide sufficient grounds for postulating a phonological contrast 
between organic and svarabhakti VCV sequences, many of the same phono- 
logical and phonetic questions posed by the contrast in Scottish Gaelic carry 
over into Irish. Are the svarabhakti vowels copies of the stem vowel so that 
a syntagmatic correspondence holds between them that is simply masked 
by unstressed vowel reduction? Do the vowels in the VCV sequences differ 
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in duration and pitch as a function of the orthographic and corresponding 
palatalisation distinction? Finally, are there in fact consistent differences 
in palatalisation that justify drawing the distinction between the organic 
and svarabhakti sequences in the first place? In this note we address these 
questions based on data collected from a native speaker of the Connemara 
dialect of Galway in West Central Ireland. 


Methods 


Our data were collected from a female native speaker originating from Leitir 
Moir (Lettermore), a rural Connamara Gaeltacht in West Central Ireland. 
While residing in Boston, she makes frequent trips back to Galway to visit 
family and friends and has taught the Irish language and the traditional song 
repertoire for many years. The data for this study were recorded in a sound 
proof booth using professional equipment. Our corpus consists of c. 300 nouns 
elicited in list format over the course of three months in 2014. Many nouns 
were recorded as pairs with the citation form followed by the inflected form 
with stem-final palatalisation as reflected in the orthography. Several sets of 
words were constructed to investigate the questions noted above. 

Our data replicate most of the restrictions on the VRC sequences 
found to govern the svarabhakti vowels in SG and earlier studies of Irish. 
First, epenthesis is restricted to clusters formed by a liquid or nasal: arm 
[arom] ‘army’, calm [k"alam] ‘calm’, banbh [panof] ‘piglet’, but cosc [kask] 
‘restraint’. Second, it is blocked in homorganic clusters (geminate integrity): 
gorm [karam] ‘blue’, binb [pinap] ‘venom’, but dord [tort] ‘buzz’. Third, 
it is also blocked before a voiceless stop (realised with preaspiration): Jorg 
[larak] ‘trace’ but cearc [k"ar"k] ‘hen’ and dorcha [taraxe] ‘dark’. Finally, 
epenthesis does not occur if the preceding vowel is long or a diphthong: 
léargas [le:rgas] ‘insight’, téarma [t*e:rma] ‘term’, ailb [a:lp] ‘alb’. In 
this case, our speaker had a few words that violated this restriction: táirge 
[t'a:raka] ‘product’. 


Set 1 


The first set of words were designed to investigate the duration and vowel 
quality correlates of the organic versus (orthographic) epenthetic stems. 
Twenty-five nouns of each type were recorded in citation and palatalised 
inflected forms. The data were analysed in Praat (Boersma and Weenink 
2019). Praat textgrids were constructed to segment the VCV intervals based 
on visual inspection of the wave forms and the spectrograms. Praat scripts 


ELEMENTS, GOVERNMENT, AND LICENSING 


collected duration measurements for the vowels (V1 and V2) as well as the 


average values for their first and second formants. 


(1) CVCVC 
citation 
dollar 
donas 
doras 
coram 
asal 
camall 
camas 
calar 
capall 
coras 
dochar 
domhan 

foras 
galar 
iarann 
imeall 
inneall 
iomas 
ionad 
ionadh 
solas 
eolas 
ladar 
sionnach 
uasal 


palatalised gloss 
dollair dollar 


donais misfortune 
dorais door 
córaim quoram 
asail donkey 
camaill camel 
camais cove 
calair cholera 
capaill horse 
córais system 
dochair harm 


domhain world 


forais foundation 
galair disease 
iarainn iron 

imill border 
innill engine 
iomais intuition 
ionaid place 
ionaidh surprise 
solais light 
eolais knowledge 
ladair ladle 


sionnaich fox 
uasail noble 


CVCC 
citation 
dorn 
corn 
calm 
carn 
arm 
banbh 
bolb 
bolg 
colg 
colm 
dearg 
fearg 
gorm 
lorg 
marbh 
scolb 
scorn 
tarbh 
tolg 
borb 
dealg 
dealbh 
fearg 
leanbh 


ainm 


palatalised gloss 


doirn 
coirn 
cailm 
cairn 
airm 
bainbh 
boilb 
boilg 
coilg 
coilm 
deirg 


feirg 


goirm 
loirg 
mairbh 
scoilb 
scoirn 
tairbh 
toilg 
boirb 
deilg 
deailbh 


feirg 


linbh 


ainm 


fist 
horn 
calm 
pile 
army 
piglet 
caterpillar 
belly 
sword 
dove 
red 
anger 
blue 
trace 
dead 
chip 
disdain 
bull 
sofa 
fierce 
thorn 
statue 
anger 
child 
name 


The notch plots in Figure 17.1 show the distributions for the duration 


measures of the V1 and V2 vowels as a function of their status as organic 
(CVCVC) versus svarabhakti (CVCC) sequences. Comparing the two plots it 
is obvious that the durations of the stressed V1 vowels are significantly greater 
than V2 for both stem types. Within each plot the durations for V1 and V2 are 
somewhat shorter for the svarabhakti stems. But a linear regression found no 
significant difference for either V1 or V2: F-statistic: 0.9031 on 1 and 98 DF, 
p = 0.3443 for V1; and F-statistic: 1.438 on 1 and 98 DF, p = 0.2333 for V2. 
We conclude that unlike in SG, the organic and svarabhakti sequences are not 


significantly different in terms of their vowel lengths. 
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20- 
20- 
Ẹ Es- 
3 so- Š 
S Š 
1o- 
10- 
so- 


eve cevéve vec eveve 
shape shape 


Figure 17.1 Duration (in ms) of first (left plot) and second (right plot) vowels of the 
svarabhakti (CVCC) and organic (CVCVC) stems. 


_ = 


60 


R.dur 


ovce cvcve 
class 


Figure 17.2 Duration (in ms) of the medial nonrhotic sonorant consonant in 
svarabhakti (CVCC) and organic (CVCVC) stems. 


The plots in Figure 17.2 show the duration of the medial sonorant in 
the two types of VCV sequences. The rhotic [r] was excluded from this 
calculation since its segmentation from the surrounding vowels was less 
clearcut compared to the lateral and nasals. In this case, there is a significant 
difference with the sonorant from the svarabhakti CVRC stems being shorter 
(F-statistic: 26.12 on 2 and 57 DF, p < 0.001). 

The plot in Figure 17.3 shows the F1*F2 distribution of the reduced V2 
stem vowels as a function of the organic versus svarabhakti classification. 
It is clear that there is no overall separation between the two stem types as 
indicated by the overlapping confidence intervals. Both types show an inverse 
relation between the first and second formants such that as F1 decreases, F2 
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Figure 17.3 First (y axis) and second (x axis) formant values (in Hz) for the second 
(reduced) vowel in svarabhakti (CVCC) and organic (CVCVC) stems. 


increases. This distribution suggests that there is just a single tongue body 
target for these unstressed reduced vowels whose height and backness is 
presumably determined by the surrounding consonants. 

In sum, it appears that the Modern Irish neutralisation of vowel contrasts 
in unstressed syllables has wiped out any length and timbre distinctions in 
the vowels that might have existed between the organic and svarabhakti 
sequences comparable to what has been documented for SG. On the other 
hand, the duration of medial sonorant itself may help to distinguish the two 
stem types. 


Set 1: pitch 


We also investigated the FO contours for the svarabhakti stems and compared 
them with the corresponding CVCVC disyllables and CVCC monosyllables. 
Forty-two words of each type were selected for analysis. We chose organic 
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words where the medial consonant is a sonorant in order to better match the 
svarabhakti stems. The VCV intervals were marked in the Praat textgrids and 
then submitted to the time normalised FO analysis in Prosody Pro (Xu 2013). 
The analysis window was set to 15 sections and the program extracted average 
FO values for each sector. These were then averaged across the 42 samples 
for each word type. The results are displayed in the chart below. The chart 
suggests that in the svarabhakti stems the FO peak is reached considerably 
earlier compared to the organic disyllables, which have a flatter more sustained 
summit that drops off at the start of the second syllable. The earlier FO peak in 
the svarabhakti sequences mimics what is found in monosyllables, which also 
begin falling at roughly the midpoint of their (single) vowel. This pitch contour 
is quite different from what has been found for SG where it is the svarabhakti 
stems that display the delayed peak reached at the onset of the copied vowel. 


Set 2: inflectional palatalisation 


We recall that the major motivation for drawing a synchronic distinction 
between the svarabhakti and organic sequences in Irish is their contrasting 
behaviour under inflectional palatalisation. This process palatalises the 
stem-final consonant or consonant cluster in certain oblique/plural forms of a 
lexically determined class of nouns. For organic CVCVC stems just the final 
consonant seats palatalisation. But in svarabhakti CVRC stems, both the R 


—cvceve ——cvrec monosyll 


270 


170 


1 1 3 5 7 9 1 13 15 


Figure 17.4 FO plots of time-normalised organic (CVCVC), svarabhakti (CVRC) 
and monosyllabic stems. 
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and the final C are palatalised even though on the surface they are separated by 
a schwa that at least phonetically is no different from the underlying reduced 
vowel in the organic CVCVC stems. Examples of the two types appear below. 


(2) citation inflected gloss 
calar  [kalor] calair [kalor] cholera 
donas [tanəs] donais [tanof] misfortune 
foras [farəs] forais [farəf] foundation 
colm  [k"alom] coilm [k"aliom] dove 
banbh [panəf] bainbh [paniəf] piglet 
gorm [karom] goirm [karom!] blue 


In order to corroborate this distinction two additional sets of words were 
constructed. The first consisted of CVRC svarabhakti stems with a medial 
lateral or nasal paired with a comparable number of organic stems with a 
medial nasal or lateral (N = 13 in each case). Stems in [r] were excluded 
because of difficulties in accurately segmenting the consonant from the 
surrounding vowels. The second and larger set included [r] but restricted the 
stressed initial vowel to /a/ or /o/. The presence or absence of palatalisation on 
the medial consonant was then inferred on the basis of its presumed effect on 
the F2 value of the preceding vowel. 

The plot in Figure 17.5 from the data in the first set clearly establishes 
that the medial sonorant in the svarabhakti stems is palatalised and hence 
distinct from the comparable consonant in the organic CVRVC stems. 

The plots in Figure 17.6 show the distribution of the F2 values for V1 
in the second set of noun stems that include the sonorant [r]. They suggest 
that in the basic citation form context there is no significant difference as a 
function of stem shape (CVCC svarabhakti versus CVCVC organic). This 
point was corroborated by linear regression (F-statistic: 1.716 on 1 and 50 DF, 
p = 0.1962). On the other hand, in the ‘derived’ context of stem-final pala- 
talisation the F2 of the first stem vowel was significantly greater (F-statistic: 
35.94 on 1 and 43 DF, p < 0.001). 

In sum, the data confirm the observation from the earlier literature that 
the medial sonorant consonant in the svarabhakti stems is palatalised while the 
corresponding consonant in the organic CVCVC stems is not even though in 
both cases these consonants are separated from the stem-final consonant by a 
full-fledged schwa vowel. In combination with suggestive differences in the 
duration of the medial consonant and its contribution to the FO pitch contour 
observed in sections ‘Set 1’ and ‘Set 1: pitch’ of this chapter, they indicate that 
the svarabhakti vowels are not just a relic from the history of the language but 
also a synchronic reality which should be expressed by the grammar. 


A NOTE ON THE SVARABHAKTI VOWELS IN CONNEMARA IRISH 


193 


1750 - 


1250- 


cvec cveve 
class 


Figure 17.5 Second formant values (in Hz) of the medial sonorant consonant in 
svarabhakti (CVCC) and organic (CVCVC) stems. 
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Figure 17.6 Second formant values (in Hz) of the initial vowel in citation (left 
side) and inflected palatalised form (right side) of svarabhakti (CVCC) and organic 
(CVCVC) stems. 
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Discussion and conclusion 


To summarise the results of this study, we investigated the phonetic reflexes 
of the svarabhakti vowels in the speech of a native speaker of Connemara 
Irish. Our speaker consistently distinguished the underlying VRC versus 
disyllabic VCVC sequences in terms of inflectional palatalisation. But unlike 
in SG, the vowels of the two structures were not distinguished by surface 
phonetic duration or quality. We did, however, uncover a distinction in 
the duration of the medial sonorant, which was shorter in the svarabhakti 
stems. We also found suggestive evidence that the svarabhakti stems had 
an earlier pitch fall compared to the organic VCV sequences. Finally, when 
informally quizzed for intuitions of syllable count, our speaker did not clearly 
distinguish the two sequences — sometimes counting them as one syllable and 
at other times as two. 

The literature on the svarabhakti contrast in both Scottish Gaelic 
and Modern Irish has seen a variety of proposals on how to account for 
the distinction. As far as the generalisations uncovered in this study are 
concerned, we believe the most straightforward analysis would assign the 
factors enumerated above at the derivational stage that precedes epenthesis. 
At this stage doras [ta.ras] ‘door’ would be represented as disyllabic while 
dorn [ta.ron] ‘fist? would be parsed as a bimoraic monosyllable /torn/. 
Inflectional palatalisation over the stem-final syllable rhyme targets the entire 
VRC sequence in dorn but just the final VC in doras. This monosyllabic stage 
provides a natural way to distinguish the duration and pitch differences as 
well. In /torn/ the medial sonorant shares its mora with the final consonant 
and hence will see its portion of the duration accorded to the mora reduced. 
Finally, the bimoraic stressed syllable of /torn/ could also seat both tones 
comprising the *HL pitch accent accounting for the earlier pitch fall compared 
to disyllabic doras. 

It almost goes without saying that one must be very cautious in drawing 
any conclusions based on data from a single speaker. But we think it is fair 
to conclude that the questions posed here and the generalisations derived, 
tentative though they may be, certainly merit further investigation. 


Note 


1. Thanks to Anton Kukhto and an anonymous reviewer for helpful comments and to Mairin Keady for 
sharing her language. 


A NOTE ON THE SVARABHAKTI VOWELS IN CONNEMARA IRISH 


195 


196 


References 


Boersma, P., and Weenink, D. (2019). Praat: Doing phonetics by computer. Computer software, 
Version 6.1. https://www.praat.org. 

Borgstrom, C. (1937). ‘The dialect of Barra in the Outer Hebrides’. Norsk Tidsskrift for Sprogvidenskap 8, 
71-242. 

Borgstrom, C. (1940). A Linguistic Survey of the Gaelic Dialects of Scotland. Vol. 1: The dialects of the 
Outer Hebrides. Oslo: Aschehoug. 

Borgstrom, C. (1941). A Linguistic Survey of the Gaelic Dialects of Scotland. Vol. 2: The dialects of Skye 
and Ross-shire. Oslo: Norwegian Universities Press. 

Bosch, A. (1998). ‘The syllable in Scottish Gaelic dialect studies’. Scottish Gaelic Studies 18, 1—22. 

Bosch, A., and de Jong, K. (1997). ‘The prosody of Barra Gaelic epenthetic vowels’. Studies in the 
Linguistic Sciences 27(1), 1-15. 

Carnie, A. (1994). ‘Whence sonority? Evidence from epenthesis in Modern Irish’. Papers in Phonology 
and Morphology 21, 81—108. 

Clements, G. N. (1986). ‘Syllabification and epenthesis in the Barra dialect of Gaelic’. In The 
Phonological Representation of Suprasegmentals, edited by K. Borgers, H. van der Hulst, and 
M. Mous, 317-36. Dordrecht: Foris. 

Cyran, E. (1996). ‘Licensing properties of nuclei and principle ranking in Irish’. The Linguistic Review 
13, 1-31. 

Fullwood, M. (2013). ‘The perceptual dimensions of sonority-driven epenthesis’. General Examination 
Paper, MIT Department of Linguistics. 

Hall, N. (2006). ‘Cross-linguistic patterns of vowel intrusion’. Phonology 23, 387-429. 

Hammond, M., Warner, N., Davis, A., Carnie, A., Archangeli, D., and Fisher, M. (2014). ‘Vowel insertion 
in Scottish Gaelic’. Phonology 31, 123-53. 

Hind, K. (1996). ‘The structure of epenthesis in Gaelic’. Journal of Celtic Linguistics 5, 91—119. 

Iosad, P. (2015). ‘Pitch accent and prosodic structure in Scottish Gaelic: Reassessing the role of contact’. 
In New Trends in Nordic and General Linguistics, edited by M. Hilpert, J.-A. Ostman, C. Mertzlufft, 
M. Rießler, and J. Duke, 28—54. Berlin: De Gruyter. 

Kenstowicz, M., and Kisseberth, C. (1979). Generative Phonology: Description and theory. New York: 
Academic Press. 

Ladefoged, P., Ladefoged, J., Turk, A., Hind, K., and Skilton, J. (1998). ‘Phonetic structures of Scottish 
Gaelic’. Journal of the International Phonetic Association 28, 1-41. 

Morrison, D. (2019). ‘Metrical structure in Scottish Gaelic: Tonal accent, glottalization, and overlength’. 
Phonology 36, 391—432. 

Ni Chiosáin, M. (1991). Topics in the Phonology of Irish. Doctoral dissertation, University of 
Massachusetts. 

Ni Chiosain, M. (1999). ‘Syllables and phonotactics in Irish’. In The Syllable: Views and facts, edited by 
H. van der Hulst and N. Ritter, 551-75. Berlin: De Gruyter. 

Noyer, R. (1990). ‘Secondary epenthesis and stress in Munster Irish’. Proceedings of the Harvard Celtic 
Colloquium 10, 1-23. 

Oftedal, M. (1956). A Linguistic Survey of the Gaelic Dialects of Scotland, Vol. 3: The Gaelic of Leurbost, 
Isle of Lewis. Oslo: Aschehoug. 

Smith, N. (1999). ‘A preliminary account of some aspects of Leurbost Gaelic syllable structure’. In The 
Syllable: Views and facts, edited by H. van der Hulst and N. Ritter, 577—630. Berlin: De Gruyter. 

Stanton, J., and Zukoff, S. (2018). ‘Prosodic identity in copy epenthesis: Evidence for a correspondence- 
based approach’. Natural Language & Linguistic Theory 36, 637-84. 

Xu, Y. (2013). ProsodyPro: A Praat script for large-scale systematic analysis of continuous prosodic 
events. Computer software. http://www.homepages.ucl.ac.uk/~uclyyix/. 


ELEMENTS, GOVERNMENT, AND LICENSING 


18 


Domino effects and licensing chains 
in government licensing: sequential 
NC clusters in Bantu 


Nancy C. Kula 


Introduction 


Co-occurrence restrictions in sequential nasal-consonant clusters (henceforth 
NC clusters) in Bantu is a well-known phonotactic effect owing to Meinhof 
(1932) and thus appropriately christened Meinhof’s Law (ML). Meeussen 
(1962) and Schadeberg (1982, 1987) provide descriptions and discussion. 
Various works in the phonological literature provide different analyses 
(Herbert 1986; Katamba and Hyman 1991; Kula 2006; Peng 2007). ML is 
akin in nature to Japanese Rendaku (Ito and Mester 1986; Nasukawa 2012) but 
differs from it, as while there is no restriction on the adjacency of the segments 
involved in Rendaku, some notion of adjacency is required in ML. In its most 
general form, ML can be characterised as a process that disallows a sequence 
of two voiced obstruents within NC clusters, that is, *NCVvNC where both Cs 
are voiced. 

From a Government Phonology (GP)/Element Theory perspective, I 
have argued that the restriction *voiced NCvNC is explainable by licensing 
relations as defined in a phonological domain (Kula 2002, 2006). Most of 
what follows here is based on that work highlighting the central argument 
that the restriction results from the failure of government licensing (as 
defined in Charette 1990, 1991) to apply, when there are competing licensing 
roles within a phonological domain, here consistent with a prosodic word. 
Another way of framing the question is, how many ‘government licenses’ 
can be offered within one interconnected domain? If this approach is on the 
right track, then it also, although only tangentially, brings us full circle to 
confirming the representations of N+C sequences as clusters in the Bantu 
languages that show ML, and begins to touch on how word maximality may 
be defined. 
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ML sound patterns across Bantu 


A short overview of ML is given here, with further details accessible in 
the various works cited. In its simplest form the rule changes the first or 
second NC cluster in a sequence of two, depending on the language, to 
either a simple homorganic nasal or a nasal geminate. There is variation 
as to whether the trigger can also be a simplex nasal (for example, 
as occurs in Luganda, Kikuyu, and Umbundu) that we discuss only 
briefly here. (1), below, gives examples from Luganda (Ashton et al. 
1954; Cole 1967), Lamba (Doke 1922) and Bemba (van Sambeek 1955). 
(1a,b) shows that the process only affects voiced NCs (N-1 — /nd/ in 1b), 
with voiceless ones (1c) showing no simplification. (2) gives a fuller set 
of data from Bemba to fully illustrate the pattern within one language.! 
In (2d—e) we see that the process also does not apply when one of the two 
NCs is voiceless irrespective of its position, and (2f) shows the signifi- 
cance of adjacency with no ML applying when the two voiced NCs are not 
adjacent. 


(1) a N-genda — penda ‘Igo’ *ngenda (Luganda) 
b. iN-lembo — inembo ‘tattoo’ *indembo (Lamba) 


c. N-tampa — ntampa ‘Ibegin’ *nampa (Bemba) 


(2) ML in the Bemba perfective (-ile ending) 


a. n-Bd:mbele — mo:mbel-e *mbó:mbele ‘I have worked’ 
b. n-la:ndile — na:ndil-e *nda:ndile ‘I have spoken’ 
c. n-tampile — nta:mpile *na:rmpile ‘Ihave started’ 
d. n-pa:ngile — mpa:ngil-e *ma:ngile ‘Ihave made’ 

e. n-lu:ntwiile — ndu:ntwiile *nu:ntwiile ‘I have bumped’ 
f. n-Bélengele — mbéle:ngel-e *méle:ngele ‘I have read’ 


We can thus summarise the markedness relations of sequential NCs as in 
(3) below: 


(3) root C; C, 
a. C,VC,- voiceless voiceless most unmarked 
b. C,VNC,- voiceless voiceless 
c. C,VNC,- voiceless voiced 
d. NC,VNC,- voiceless voiceless 
e. NC,VNC,- voiceless voiced 
f. NC,VNC,- voiced voiceless 


*g. NC, VNC,- voiced voiced most marked 
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Licensing chains: the domino effect 


I will assume Strict CV in the following discussion (Lowenstamm 1996) 
and assume, following Scheer (2004), that government and licensing are 
the relations that define the syntagmatic relations between Cs and Vs to 
generate surface linear order building on standard GP work (Kaye et al. 
(KLV) 1990). The starting point is the licensing principle mandating that 
all positions within a phonological domain have to be licensed, apart 
from the head, which is itself the source of licensing potential, usually a 
dominant nucleus — V position in current currency. To recap and recast 
standard assumptions (Charette 1990, 1991) the head V position licenses 
all other V positions in the domain, which in turn license the C positions 
in their CV pairs. The precise manner in which such licensing applies is 
nicely exemplified in Harris’ (1997) Licensing Inheritance Principle as 
given in (4). 


(4) Licensing inheritance principle 
A licensing position inherits its autosegmental licensing potential from 
its licensor. 
Prosodic licensing and autosegmental licensing 
Prosodic licensing sanctions the presence of positions at different 
levels of projection from the skeletal tier upwards. Autosegmental 
licensing determines the melodic content of a particular position. 


The domino effect of the LIP and licensing through a domain as discussed 
above is exemplified in (5) below. Assuming V, to be the head, it licenses V, 
and then V, by domino effect. V positions then license C positions in their 
CV pairs: 


(5) V- and C-licensing 


V-licensing 


i a ry! 


CV; ©, VW, G V3 
ae C-licensing 


We can refine this idea to distinguish (i) the licensing of positions versus 
(ii) a position acquiring licensing potential so that it can itself be a licensor. 
Thus, in (5) all positions are dually licensed to be licit entities and are further 
licensed to license either melodic material (autosegmental licensing) or 
processes. Further, maximising this idea, we will assume that every process/ 
activity within a phonological domain must be licensed in some way. 
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This in a way takes to heart what is already implied in the non-arbitrariness 
fundamental principle in KLV (Kaye et al. 1990, 194), requiring a non- 
arbitrary relation between a phonological process and the context in which 
it occurs. This then, coming back to ML, raises the question of licensing 
potential. How much licensing potential does a position have, and can we in 
some way, think of this potential as deteriorating as more licensing tasks are 
performed, so that a saturation point when no more licensing can be done is 
reached? This is the basic idea pursued here.” ML seems like good ground 
to test this idea as it offers us the phonotactic refinement given in (3) where 
voiceless only and a voiced-voiceless mix of NCs is acceptable but two 
voiced NCs is not. So that it is not just about the complexity of having two 
NCs. In addition, the locality restrictions also suggest proximity to potential 
licensors as playing a role. 


Types of licensing and their relations 


Before getting to the analysis, we need to make a few more assumptions 
in the context of enriched licensing (see also discussion in Cyran 2010 
on this). I will assume three types of licensing as categorised below and 
also that they move from being universal and across the board to being 
more language specific, which we can capture by parameter. Type I 
involves licensing positions which every language will do. Type II and 
IL are both going to involve parametric choices, with the former focused 
on the licensing of constituents while the latter deals with content both 
in terms of melody and processes. There is a natural hierarchy only in 
the sense that we assume a phonology needs positions which can then 
interact to create different syllable structure complexities and which 
themselves host melody that interacts in different ways. This is captured 
in (6) below, with (6b) giving an example of how syllable complexity 
emerges as different licensing relations are allowed in a language. The 
contrast between the presence of TR versus RT clusters in a language, for 
example, follows from which licensing relations are allowed in Type I 


licensing. 

(6) a. Type I » Type IL » Type Il 
Basic (universal) Less basic Language specific 
position licensing government licensing licensing processes 
V positions licensing gov. licensors licensing changes in 
C positions p-licensing melodic structure 
b. CV » CC(TR) » CC(RT) » (processes) 
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In (7) we see that V-licensing, that is, licensing that emanates from a V 
position, can be either local in targeting other V positions or non-local 
interacting with C positions within its CV unit. 


(7) 
within its domain (nuclear level) — local licensing 
a. V-licensing 


outside its domain (CV level) — non-local licensing 


b. local licensing » non-local licensing 


We can thus trace a licensing path in some language (x) with positive 
parameter settings as yielding the licensing hierarchy in (8): 


(8) Hierarchy of licensing functions in a domain: 
local licensing » non-local licensing » licensing government licensors » 
government licensing » p-licensing » licensing processes {p!...p"} 


NC simplification as a result of licensing demand 


The process of NC simplification or ML that we have seen applies in a 
context that involves consonant hardening (1d, b—b) and voicing. In fact, 
these two processes cannot be told apart in such cases (see examples in (2)) 
as the result of hardening is a voiced obstruent. But other Bantu languages 
allow us to distinguish these are two separate processes when post-nasal 
voicing applies to voiceless obstruents. Considering NC simplification in 
examples like (2b) we can capture the process as in (9) below. Following 
assumptions on the commutability of nasality and voicing as now accepted 
in Element Theory (see, for example, Nasukawa 1998; Botma et al. 2011), 
voicing is treated as the sharing of the |L| nasal element in the nasal prefix 
with the following sonorant /I/ (|?| must also be shared from the nasal to 
effect hardening). za representation of voicing is simplified here for pres- 
entation purposes and is represented as a change of head from a headed |L| 
in the nasal prefix to a non-head |L| in the resulting voiced obstruent.? A 
process that Charette and Göksel (1998) refer to as ‘switching’. We assume 
a government relation between C, and C, that renders the intervening V 
position inert. 
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(9) input output 1 output 2 


Ci VG... = CG, VG... 7 Q FV Ciz 
| | | | | | 
L L ————> L L —> L 
/N/ N /n/ /d/ /n/ /n/ 


The input is the nasal with |L| head in C,, which spreads to C, to result in 
voicing (or at least should), but then output 2 is the final form where nasal |L] 
spreads but fails to change its head status. ML or NC simplification therefore 
amounts to the failure of head switching (or failure to assume a different 
structural position allowing a voicing interpretation). The central question is 
thus why ‘switching’ should fail to take place? And importantly how can we 
represent this failure? 

Licensing restrictions have been expressed in GP with respect to 
undergoers being unable to be triggers. Thus Charette 1991 argues that 
properly governed positions may not themselves be proper governors, and 
similarly Szigetvari and Dienes (1999) showed that governed positions 
may not themselves govern. On the back of these restrictions I argue in 
Kula (2006) that there are also restrictions on triggers so that triggers may 
not be multiple triggers. In this respect governors and proper governors 
may not be multiple governors and proper governors, respectively. I revise 
this idea here and cast it firstly as parametric and secondly as defining 
additional kinds of licensing that present demands on a head nucleus. We 
can thus consider that NC clusters require to be licensed by drawing on: 
Type I licensing of constituents, government licensing to define the NC 
sequence, and that Vs will need to be licensed to perform each of these 
different tasks. 

Under this reasoning we can account for ML/NC simplification as 
resulting from the inability of one V position, amidst other competing 
licensing demands, to multiply license government that is needed to maintain 
an NC cluster. We assume the head V position in a Bantu verb is the root V 
position which may in full verb forms be preceded by TAM markers. This 
position of the head is crucial, indeed supported by the patterning of ML 
where it is sandwiched between the two NC sequences. Recall that we are 
assuming that all relations and processes must be licensed in some way and 
therefore in this context only a domain head V can inherently be a government 
licensor and any other V position needs to specifically be licensed to be a 
government licensor apart from other licensing, in particular local/basic Type 
I licensing that sanctions the existence of constituents. The ungrammaticality 
of (11) then follows from the inability of V, to both license government 
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licensing in V, and simultaneously directly government license C, to then 
govern C, to create the NC cluster. 


(11) *NCVNCv... 


*gl lgl 
y | ý (local » non-local) 
C yG Vy G Vz Cy Va 
LOL EOP o ooo 
x x X x x xX x x 
| or | | | 
N C v N C. Van 
A ' A 
Sa? Pye cae government 


The other licensing relations involved in this representation are touched on 
below, but so then here the failure of V, to government license C, results 
in simplification. This would be the Bemba case in the examples in (2) 
where local licensing precedes non-local licensing for government licensing. 
We must assume that this choice is parametric because in languages like 
Kwanyama (Bantu, Namibia) (Schadeberg 1982) it is the second NC in a 
sequence that simplifies, as (12) shows, but also it is the nasal rather than the C 
that is lost. There are a few assumptions we need to make to get this, but space 
precludes further discussion (see Kula 2006). 


(12) ML in Kwanyama: ngombe — ongobe ‘cattle’ 


gl *]gl 
y | t (non-local » local) 

C V CG, V C V Cy Vae 

Loko] E eh 

x x xX x x xXx x x 

| | | | | | 
. N C v N C V 

A | x” ; 

Pe ieee S government 


The reason why only the restriction on triggers does not capture the distribu- 
tion, that is, that a government licensor (a V position) cannot also be a licensor 
of government licensing in another V, is because not all sequences on NCs are 
disallowed. Two voiceless, or one voiced and the other voiceless are allowed. 
These cases would need both clusters to be government licensed and we 
assume that they are as in (13). 
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(13) ML w Voiceless NCs (example 1c): ntampa — ntampa ‘I start’ 


*nampa 
gl lgl 
C VGV C V Q Ve 
Ooo po p p f 
x X X X x xX x x 
| on | | 
N C y N C Ve 
A g C. 


sser Sasat government 


The motivation then for *NCvNC when the two NCs are voiced is to do with 
the additional licensing of voicing. Recall that this involves a process of |L| 
spreading from the nasal to the obstruent, treated here as involving ‘switching’ 
which fails to be licensed to take place under licensing demands. This is not 
the case in (13) and so both clusters can be successfully licensed. 

The other well-known pattern in ML/NC simplification is cases where 
the rule applies before a simplex nasal as in Luganda (also referred to as the 
Ganda Law). We can relate this to a co-occurrence restriction that bars a 
non-head |L| in the presence of a headed |L]. In the current discussion, this is 
to express that ‘switching’ fails to be licensed in the presence of an |L|-head 
in the domain. This is a mirror image of the process involved in the general 
version of ML and can be regarded as within expected parametric variation, 
expressed in the following co-occurrence restrictions: 


(14) MLI: presence of ‘switching’ in a domain bars another application of 
‘switching’ 
ML2: presence of an |L| head in a domain requires any other |L| to 
also be head. 


A final point to consider is why there is no simplification when the two voiced 
NCs are not adjacent, as example (2f) shows, that is, *NCVNCv with voiced 
obstruents but NCvCvNCv also with voiced obstruents is grammatical. The 
argument is that in (15) the head V, is not directly involved in licensing 
the government licensing of the following voiced NC cluster because it is 
not immediately adjacent to it. V, only licenses V, on its right to exist as a 
constituent, V, then licenses V, which licenses government licensing, the 
constituent, and |L| head switching to allow the voiced obstruent. V, is itself 
not a government licensor and as such has enough licensing capacity to pass 
on to V,. This implies that it is more about how much licensing a particular 


ELEMENTS, GOVERNMENT, AND LICENSING 


V position is able to sanction, here V, which is hence relieved when no NC 
immediately follows, and not so much about the overall amount of licensing 
available to a domain. Naturally, the amount of licensing in a domain is 
impacted upon by having restrictions on what a single V can do, but only in 
this indirect way. 


(15) Grammatical NCvCvNCv (Bemba mbelenga ‘I read’) 


Sele! S- government 


To summarise, we account for ML/NC simplification as the failure of 
government to take place because the potential governing head fails to be 
licensed to be a governor. More specifically this must be interpreted as the 
failure to license ‘switching’ under |L| spreading from the nasal to the stem- 
initial position. Under this view we can then explain why voiceless NC’s 
never undergo ML — switching never has to be licensed in these clusters. 
Similarly, a sequence of voiced and voiceless NCs is also possible because 
switching is only licensed once in the voiced NC. It is crucial that the same 
V position is implicated in the licensing of switching, since if it is not, 
simplification does not occur as seen in examples where the two NCs are not 
immediately in sequence. This explains why place assimilation is still seen 
to apply. 


Defining saturation points 


A natural consideration to finally make is whether licensing within a domain 
can be defined globally and by so doing feed into the possibility of defining 
limits on word size in language, given the possibility of word minimality 
and possibly maximality effects. This would require us to consider licensing 
saturation points that define restrictions on word size per language, defined 
over the whole phonological domain/V positions. 
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The distinction made between more basic and less basic licensing 
functions would help us distinguish between less marked and more marked 
structure. This means that simple syllable structures that only require basic 
licensing, such as a CV language type, require less licensing than languages 
with CC clusters that also need Type II licensing. A characterisation of 
saturation over a domain must therefore oppose the syllable structure type 
with licensing potential in relation to domain size. This is depicted in (16) 
where SS stands for syllable structure complexity, LP for licensing potential, 
and DS for domain size. 


(16) Defining saturation points 
a. SS (x) LP (z) DS (licensing path) (y) 
(i) increase increase decrease 
(ii) decrease decrease increase 
b. (i) z=x+y where z = (x—1) + (y+1) 
Gi) LP=SS|+DSt or LP=SSt+ DS] 
(iii) z=k? 


In (16a, (i)) increasing the complexity of SS also increases the LP needed 
to license such structure in which case LP will be depleted quicker from 
the greater demand resulting in a shorter (but more complex) licensing 
path and hence a shorter phonological domain. The converse relation 
holds for (16a, (ii)); a decreased complexity in SS requires less LP to be 
sanctioned, and thus can accommodate more structure which means an 
increase in phonological domain size. The question for the characterisation 
of licensing saturation, expressed in (16b, (iii)), is whether we can equate 
LP to a constant (k) and, further, whether such a constant should be deemed 
universal. 

Under such a view of licensing saturation we could regard ML as 
resulting from a saturation point being reached in the resolution of an increase 
in SS complexity (the need to license voiced NCs) which implies more 
licensing functions that demand increased LP. 


Conclusion 


Licensing Inheritance suggests that licensing potential is spread in varying 
degrees in a phonological domain: recessive V positions depend on preceding 
V positions for their licensing tasks. It is argued here that because of this, a 
V position that has itself licensed a preceding or following C position to be a 
governor cannot also license another V position for the same task. By virtue 
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of its own action, a government licensor has depleted its licensing potential 
for further tasks of the same nature, termed here as licensing saturation. This 
failure to license government after depletion of licensing potential has been 
interpreted as the failure to license ‘switching’ in NC clusters in Bantu that 
then gives rise to the ML/NC simplification effects. The analysis requires the 
treatment of licensing as of three different types that stand in an implicational 
relation — giving explanation to why CV is the simplest syllable type. 

An alternative version of licensing saturation, which assumes an upper 
limit on licensing potential, draws a relation between syllable structure 
complexity and phonological domain size, so that a flat licensing path has 
more potential for increased phonological domain size than one where clusters 
must be licensed. It remains to be seen whether there is empirical evidence to 
support such a view. 


Notes 


1. The initial N- represents the 1*'sg. subject marker and (i)N- the Class 9/10 noun class marker. 
Consonant hardening in (1b) also applies in (2a—b) and is a regular standard process in this context. 

2. This follows Kula (2006). In recent OT work this idea is very similar to Zimmermann (2018). 

3. My assumption is that element geometries, building on dependency relations, is what captures the 
interpretation of |L| as nasality, voicing, or low tone. See Kula (2002), Botma (2004), Liu and Kula 
(2020) for discussion. ‘Switching’ will be used as a shorthand for this. 

4. In earlier versions of GP this was, for government, expressed at the constituent level by having binary 
branching structures so that a governor could only govern one dependent. 
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CaCj in French 
Tobias Scheer 


Charette’s analysis of CaCj 


Charette (1998, 171-3; 2003, 471-5) wonders why schwa may be left 
unpronounced in [CaCj] cimetière (cim’tjére) ‘cemetery’, but not in [CaCj] 
atelier (*at’ljer) ‘workshop’ (this pattern is also analysed by Cavirani in 
Chapter 15).! In these articles (173/473ff.), Charette ascribes the contrast 
to the type of CC cluster created if schwa were left out: schwa may only 
drop if this cluster is a good coda-onset sequence in Standard Government 
Phonology (GP) (Kaye et al. (KLV) 1990; Charette 1991). This amounts 
to any cluster which exists in intervocalic position and is different from a 
branching onset (obstruent-liquid TR) and a bogus cluster (tl, dl). In the 
[CaCj] pattern, though, the cluster created by the non-pronunciation of schwa 
([mt] in cim tière) cannot be a coda-onset sequence since it encloses a schwa 
in the lexicon and resyllabification is prohibited in Government Phonology 
(Kaye et al. 1990, 221). Therefore, Charette argues, the CCs at hand are in 
fact onset-onset clusters that afford a right-headed onset-to-onset government 
whose definition is the same as the one that holds within coda-onset 
sequences: any intervocalic cluster that exists in the language qualifies except 
TR and bogus [tl, dl]. 


(1) CaCj (Charette 1998, 2003) 
a. cimetière b. atelier 
IO-Gvt PG PG 


SNL NA A N 


ONON ON ON ON ONONONON 
Mh 2 | M ge | | o | | 
c im t I er ato l I eœ 


ə 
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Thus in cim’tiére under (la), the [m] and the [t] are onsets and [t] governs 
[m], thereby circumscribing the empty nucleus enclosed which for that reason 
is rightfully empty: [metejV] is well-formed since both empty nuclei are 
taken care of, the first by onset-onset government, the second by government 
from the following V. In *at’/ier under (1b) on the other hand, being bogus, 
[t] and [l] are unable to establish interonset government and therefore 
schwa cannot drop when followed by another empty nucleus: *[telojV] is 
ill-formed because the first empty nucleus is neither governed nor enclosed 
in a governing domain (while the second empty nucleus is governed by the 
following V as before). 

Charette’s analysis of [CaCj] is thus based on the management of the 
CC cluster that is left when schwa remains unpronounced, knowing that it 
encloses an empty nucleus and is followed by another empty nucleus in all 
cases: in Charette’s analysis, [Cj] identifies as /Coj/ whose empty nucleus is 
governed.” 


Charette’s earlier analysis of CaCj 


Both data and analysis presented in the previous section have changed since 
Charette (1991, 115ff.) first studied [CaCj]. In 1991, she compares Quebec 
and Parisian French, reporting that while schwa is always realised in the 
former (cim/a/tiére, at{a]lier), it may remain (or actually is) unpronounced in 
the latter (cim tière, at’lier). That is, Charette does not make any difference 
between the cimetiére type and the atelier type. 

The contrasting behaviour of the two types had been observed by 
1991, though, but as far as I can see only by Dell (1973, 262ff.) for Parisian 
French: schwa may be left out in the cimetiére type, but not in the atelier 
type. In her 1998 and 2003 articles, Charette acknowledges this difference 
also for Quebec French: she now reports that schwa may or may not be 
realised in the cimetiére type, while (as in Parisian French) it is mandatory 
in the atelier type. 

Regarding the analysis, in 1991 Charette argues that schwa cannot drop 
in Quebec French because the following Cj is a branching onset 2a, but that it 
may be left out in Parisian French since the yod belongs to a complex (light) 
nucleus 2b (see Péchtrager’s Chapter 12 on light diphthongs in general and in 
French). 
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(2) CaCj (Charette 1991, 115ff.) 


a. Quebec French b. Parisian French (schwa 
(schwa mandatory) may be left unpronounced) 
AA 2 
ONONON ON ON ONON ON 
Pol bw PY i oe ee 
x X X XXXXK XX x X KX KX XX XK XK X 
Po Pe ea | | | Il || 
c i mottlie r c im tIer 
a t ollie a t IIe 


Under (2a), the nucleus containing schwa cannot be reached by government 
from the following nucleus since in Standard GP government cannot jump 
over a branching onset (Charette 1991, 113ff.). Therefore, the ungoverned 
nucleus must be realised and schwa is pronounced. By contrast under (2b), 
only a single consonant separates the schwa-containing and the following 
nucleus: the yod is part of the latter. Thus nothing prevents government falling 
on the nucleus containing schwa, which therefore may be left unpronounced. 
As was mentioned, this analysis is based on incorrect data: in both 
varieties, there is a crucial difference between the cimetière type and the atelier 
type of words. But the analytic options are interesting: in [CaCj], schwa is 
mandatory when the yod belongs to a branching onset, while it may be left out 
in case the yod belongs to a complex nucleus. This contrasts with the analysis 
discussed in the previous section where [Cj] identifies as two independent 
onsets. Also note that in Charette’s 1991 analysis, the syllabic status of the 
[Cj] cluster is responsible for the behaviour of schwa, while in her 1998/2003 
analysis the syllabic identity of the CC cluster drives schwa deletion. It will be 
shown in the ensuing section ‘Empirical situation’ that the former is correct. 


Syllabic identity of Cj 


Complex nucleus 


The complex nucleus analysis is a consequence of the incorrect data that 
Charette’s 1991 analysis was based on: it accounts for the pattern where schwa 
in [CaCj] can be dropped no matter what (which is what Charette thought 
occurs in Paris). It may therefore be discarded. 

This is particularly clear when considering cases such as cafet-iére 
(schwa may be dropped) and hétel-iére (schwa must be realised) where the 
yod and the following vowel belong to a suffix (the morphological structure 
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of cimetière and atelier is less clear): the syllabic identity of the suffix-initial 
[DV] sequence must be the same in both types of words since the [jV] belongs 
to the same morpheme. If it identified as a complex nucleus, it should be able 
to govern the preceding schwa in both types of words — but this is not the case. 


Branching onset 


The branching onset analysis of [Cj] is impossible in Standard GP where resyl- 
labification is prohibited (Kaye et al. 1990, 221; Charette 1991, 222, note 1): 
whatever the lexical identity of the yod in the suffix -iére, it would need to 
become the second element of a branching onset, whose first element is the 
root-final consonant, a simple onset in the lexicon. In other words, a branching 
onset made of items from two different morphemes is impossible in Standard GP. 

In this context, let us consider the parallel between [oTRV] and [sCjV] 
that Charette (1991, 116ff.) makes explicit. In her 1991 analysis, the reason 
why in Quebec French schwa cannot drop in la s/a/crétaire ‘the secretary’ 
(aTRV) and at/ajlier (aCjV) is the same: its nucleus cannot be reached by 
government, which is blocked by the following branching onset (TR in the 
former, Cj in the latter case) (see 2a). 

In French, word-internal clusters of three consonants always involve 
a coda followed by a branching onset [C.TR] (perdre ‘to lose’, arbre ‘tree’, 
filtre ‘filter’, mercredi ‘Wednesday’, etc., Dell 1995, 10-17). Therefore, 
knowing that the yod is indeed a consonant (rather than part of a complex 
nucleus, see the previous section), the [Cj] in [C’Cj] (cim ’tiére) can only be a 
branching onset. That is, [C’Cj] from /CaCj/ and [C’TR] from /CaTR/ should 
instantiate the same syllabic pattern: a coda followed by a branching onset.* 

Given this situation, not pronouncing schwa in [CaCj] sequences 
should be just as fine as leaving it out in [CoTR] clusters: speakers who can 
pronounce [C’TR] should also be able to pronounce [C’Cj]. 


Empirical situation 


In her 1998/2003 articles, Charette mentions ten words that illustrate the [CaCj] 
pattern: four where schwa can be dropped ([mtj] cimetière, [ftj] cafetière, [nbj] 
canebiere, [ntj] canetiére) and six where it cannot. Among the latter, four have a 
bogus cluster [dl+j], [tl+j] (hôtelière, atelier, chandelier, dentelliére), one shows 
[fl+j] (bachelier) and one appears with [ml+)] (sommelier). Given her analysis, 
Charette must say that [fl] and [ml] are not possible coda-onset (and thus 
interonset) sequences. She does not discuss the issue, but supporting evidence is 
the fact that these clusters do not occur in intervocalic position in French.* 
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Looking at a broader record of words instantiating [CoCj], it appears that 
the CC cluster created by the putative loss of schwa does not bear any respon- 
sibility for the (non-)pronunciation of schwa. Rather, the pattern is governed by 
the consonant following schwa: if this consonant is the lateral [1], schwa cannot 
be left out. If this consonant is different from the lateral, schwa may be dropped. 

This is shown under (3), where [CaCj] clusters produced by the suffix 
-ierl-iére appear.’ The lexical record for [CaC]-ier/-iére should be near 
exhaustive and may contain a number of words that natives are unfamiliar 
with. In case speakers encounter a word that they do not know, they will treat 
it as a new lexical item, that is, as a nonce word, and that does not appear to 
alter the grammaticality judgements reported below. 


(3) CaC-ier/-iére 
a. ClaC,j where C, #1 


b s gibecière ‘gamebag’ n t lunetier ‘glasses maker’ 
f t buffetier ‘shopkeeper’ t panetier ‘bread keeper’ 
t cafetier ‘café tenant’ t panetière ‘bread cupboard’ 
t cafetière ‘coffee maker’ t robinetier ‘tap maker’ 
1 t giletier ‘waistcoat maker’ v chènevière ‘hemp field’ 
t giletière ‘watch chain’ p t papetier ‘paper maker’ 
t molletière ‘puttee’ r t charretier ‘carter’ 
t muletier ‘donkey keeper’ t ferretier ‘farrier hammer’ 
t pelletier ‘furrier’ t jarretière ‘garter’ 
t pelletiérine ‘type of dewormer’ f t guichetier ‘counter clerk’ 
t toletière ‘oar reenforcer’ t n centenier ‘centurion’ 
m t cimetière ‘cemetery’ v t buvetier ‘barkeeper’ 
n b canebière ‘hemp field’ t louvetier ‘wolf hunter’ 
t bonnetier ‘bonnet maker (masc. t savetier ‘cobbler’ 
t bonnetiére ‘bonnet maker (fem.)’ z n dizenier ‘commander of 10 men’ 
t canetiére ‘silk worker’ t gazetier ‘gazette owner’ 
t chainetier ‘chain maker’ t noisetier ‘hazel tree’ 
t grainetier ‘seed merchant’ s v sanseviere ‘sansevieria (bot.)’ 
b. ClaC,j where C,= 1 
d 1 chandelier ‘candle stick’ t 1 coutelier ‘cutler’ 
m | chamelier ‘camel caravan driver’ 1l hotelier ‘hotel keeper’ 
1 sommelier ‘wine waiter’ 1 râtelier ‘rack’ 
n l cannelier ‘cinnamon tree’ 1 dentellière ‘lacemaker’ 
1 tonnelier ‘cooper’ s l chancelier ‘chancellor’ 
1 tunnelier ‘tunnel borer’ 1 pincelier ‘brush box’ 
p l chapelier ‘hat-maker’ 1 vaisselier ‘dresser’ 
r l bourrelier ‘harness-maker’ 1 ficelier ‘trickery-using person’ 
J 1 bachelier ‘bachelor’ 1 boisselier ‘wooden objects merchant’ 
1 échelier ‘ladder’ 1 museliére ‘muzzle’ 
1 richelieu ‘type of shoe’ 1 oiselier ‘bird-seller’ 
t l atelier ‘workshop’ 1 roselière ‘reed bed’ 
1 batelier ‘ferryman’ 
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Grammaticality judgements by seven native speakers from mainland France 
that were born and brought up North of the Loire were collected. Only 
speakers were selected who could leave out schwa in CaTR /a secrétaire ‘the 
secretary’: this made sure that negative judgements of a particular item were 
not due to the impossibility for schwa to be left out before branching onsets 
(see section above, ‘Branching onset’: a number of speakers of Northern 
varieties cannot leave out schwa before TR). The speakers selected were 
asked to say for each word whether a pronunciation without schwa is possible, 
bizarre but possible, or impossible. 

Results are unambiguous: items under (3b) where yod is preceded by 
the lateral were overwhelmingly judged impossible by all speakers, while 
items under (3a) where a consonant different from the lateral precedes yod 
were overwhelmingly said to be possible. Details are as follows. 32 [CC+] 
words (3a) and 15 [Cl+j] words (3b) were tested ([CCC+j] and [Cs+j] 
were not included, the latter because of possible [s+C] effects). Participants 
had one vote for each word where schwa was left out, to be chosen among 
‘possible’, ‘bizarre but possible’, and ‘impossible’. With seven participants, 
for any given word each of the three possible votes scores between zero and 
seven. The average score of ‘possible’ for [CC+j] was 4.88, against 0.4 for 
[Cl+j]. Conversely, ‘impossible’ scored 0.94 with [CC+j], against 5.27 with 
[Cl+)] (scores for ‘bizarre’ were comparable: [CC+j] at 1.19, [Cl+j] at 1.33). 
The contrast between [CC+j] and [Cl+j] is thus very strong, in the direction 
predicted. 

The same contrast appears when looking at percentages: 69.6 per cent of 
responses for [CC+j] items were ‘possible’ (156 of 224), against only 5.7 per 
cent for [Cl+j] items (6 of 105). This difference is of course highly significant 
(y2 (1, N = 224) = 116,89, p<.00001). Conversely, 13.4 per cent of votes were 
on ‘impossible’ for [CC+j], against 75.2 per cent for [Cl+j]. The difference is 
also highly significant (x? (1, N = 224) = 123,42, p <.00001). 


The blame is on the lateral 


The testimony of words suffixed by -ier/-iére under (3) shows that it is not 
the cluster type preceding yod that governs schwa loss in [CaCj] (as held by 
Charette 1998/2003), but rather the [Cj] cluster following schwa (as argued by 
Charette 1991): schwa may be dropped when followed by any consonant plus 
yod, except if this consonant is the lateral. 

Note that Charette’s (1998, 173; 2003, 473ff.) generalisation according 
to which schwa can be dropped if the CC in [CaCj] makes a good coda-onset 
sequence is also contradicted by [rl] (bourrelier) and [sl] (vaisselier) where 
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schwa is mandatory, although the clusters at hand are good coda-onset 
sequences in French, as shown by merle ‘blackbird’, arlequin ‘harlequin’, 
(avoir la) berlue ‘being blind’, dirlo ‘director’, horloge ‘clock’, hurler ‘to 
yell’, etc., for [rl] and islam ‘Islam’, Islande ‘Iceland’, législatif ‘legislative’, 
etc., for [sl]. 

Given the parallel with [CaTR] where TR is a good branching onset (see 
section “Branching onset’), the generalisation suggests that yod can engage with 
any preceding consonant to form a branching onset, except with the lateral: 
speakers who can drop schwa in [CoTR] (/a s ’crétaire ‘the secretary’) can also 
leave it out in [C,9C,j], provided that C, is not the lateral (cim tière). In other 
words, any [Cj] sequence is a good branching onset, except [lj] which is not. 

Charette (1991) was thus right in identifying the syllabic status of [Cj] in 
[CaCj] as the driving force of schwa deletion. She did not identify the distri- 
butional pattern that decides whether schwa may be left out or not, though: 
schwa may be dropped when followed by any consonant plus yod, except 
if this consonant is the lateral. Has this generalisation gone unnoticed in the 
literature on [Cj] in French, then?° No: Dell (1973, 262) incidentally notes that 
he will not address the question why schwa cannot be deleted when followed 
by a liquid and yod (‘La seconde question que nous laisserons de côté est le 
maintien de schwa lorsqu’il est suivi d’une liquide et d’un yod’). As far as 
I can see, he is the only author who has understood that schwa deletion in 
[C,9C,j] is blocked when C, is the lateral. But since he says that he will not 
pursue this question, the pattern remains largely unillustrated in his book. It 
may thus be the case that the data in the section ‘Empirical situation’ are the 
first empirical demonstration of the generalisation at hand. 

Finally, note that Dell’s statement also includes [rj] clusters: he says that 
schwa cannot drop in [C,0C,j] if C, is a liquid, that is [r] or the lateral (note 
that [C,9C,]-ier/-iére under (3a) does not appear to produce any cases with 
C, = [r]). It takes some argument to see that Dell is right (a piece of evidence 
are conditionals like donn-er-i-ez ‘you (pl.) would give’ or huil-er-i-ez “you 
(pl.) would oil’ where schwa cannot be dropped: *donn’rjez, *huil’rjez), but 
[rj] clusters represent an intricate pattern in French that cannot be examined 
here for lack of space. 


Analysis based on branching onsets 


It was mentioned in section ‘Branching onset’ that Charette’s 1991 analysis 
where [Cj] in [CaCj] is a branching onset in Quebec French is impossible in 
Standard GP since this would require resyllabification, which is prohibited in 
this theory: in cafet-iére for example, the lexically simplex onset containing 
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the root-final consonant would have to become a complex onset in order to 
accommodate the yod coming from the suffix. 

As we have seen, though, the pronunciation of schwa in [CaCj] depends 
on whether the following [Cj] cluster does (all [Cj] except [lj]) or does not 
({lj]) qualify as a branching onset. In Strict CV (Lowenstamm 1996; Scheer 
2004), a branching onset identifies as a TR cluster belonging to two onsets 
whose solidarity is due to an infrasegmental relation (< in 4a) (Scheer 2004, 
§64; Brun-Trigaud and Scheer 2010). The empty nucleus enclosed does not 
require government because it is silenced by this infrasegmental relation. 
Being therefore ungoverned, it is itself a good governor and governs the 
preceding schwa. This is shown under (4a) for [tj] in guichet-ier where schwa 
may be left unpronounced. 


(4) CəCj in Strict CV 
a. [Cj] branching onset: schwa governed b. [lj] no branching onset: schwa ungoverned 


Gyt Gyt 


Cc VC V € V=CV GC VC VC Vs CV 

bo | to to | | bo ft J | I | 

g i fot €je o to J je 
guichet-ier [ftj] hotel-ier *[tlj] 


By contrast under (4b), government cannot reach schwa because [lj] is not 
a good branching onset: the nucleus enclosed needs to be governed and 
therefore cannot be a governor itself. Thus schwa is pronounced. 

Note that the issue of resyllabification does not arise in Strict CV 
since there are no branching constituents and there is no coda constituent 
(or post-rhymal complement): under (4a), the suffix-initial yod makes a 
branching onset with the root-final [t] by establishing an infrasegmental 
relation (€). 


Conclusion 


The [CoCj] pattern is but a piece of the [Cj] puzzle in French, and the 
preceding pages have studied only a facet of that piece. Some other aspects 
and pieces are the following. Why is it that diérése, that is, expanding [j] into 
[ij], allows schwa to remain unpronounced in conditionals (vous donn-er-i-ez 
[don’rije] “you (pl.) would give’), but not in [Cal]-ier (hôtel-ier *[ot’lije])? 
How come [Rlj] from /Rlj/ is possible (per/-ier [rlj] ‘pearl, adj.’, vous parl-iez 
‘you (pl.) talked’), but not when [RI] encloses a schwa /Ralj/ (*bourr lier 
“saddler’)? Why can schwa be absent in vous vous atteliez (att liez [tlj]) ‘you 
took care of’, but not in atelier (*at’Tier *[tlj]) ‘workshop’ (Klein 1992, 39)? 
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Why is [Tlj] well-formed when created by -i-ons/-i-ez (vous boucl-i-ez [klj] 
‘you (pl.) surround / bring to an end’), but impossible when originating in 
other suffixes (boucl-ier *[klj] ‘shield’) (Dell 1972, 73)? How come that 
-i-ons/-i-ez can occur without [i] after [Tl] (vous boucl-i-ez [klj] “you (pl.) 
surround / bring to an end’), but not after [Tr] (vous plåtr-i-ez [trij] *[trj] ‘you 
(pl.) plaster’) (Dell 1972, 73; Tranel 1987, 121; Klein 1992, 23ff.)? I pursue 
these issues elsewhere. 

And there is also something to be understood about the whole [Cj] 
pattern in Quebec French. Recall the parallel between [CaTR] and [CaCj] 
(‘Branching onset’ section): it holds indeed for the variety described in the 
article (unmarked Northern mainland or Parisian speakers), but Quebec 
French is different. Here the branching onset [Cj] (cim tière) allows for the 
preceding schwa to be dropped, but regular branching onsets TR (obstruent- 
liquid) do not: la s’crétaire ‘the secretary’ and le s’cret ‘the secret’ are 
possible in Paris, but not in Quebec French (Charette 1991, 102-4). So what 
does that mean? The existence of two different types of branching onsets in 
Quebec French, or in any other language for that matter, would be unsettling. 


Notes 


1. Most French words that appear in this chapter are listed in the section ‘Empirical situation’ where 
glosses are provided. Elsewhere, glosses only appear for words that are not mentioned in this section. 

2. In fact, following Haworth (1994), [Cj] in Charette’s analysis is a C followed by an I element that 
branches on both the following nucleus and the following onset: [Cj] is realised when the /i/ of /CijV/ 
is governed by the following V. This is to account for the fact that based on an invariant underlying /ij/, 
the i is unpronounced (governed) when preceded by a single consonant as in abricot-ier [absikotje] 
‘apricot tree’, but realised when preceded by a TR cluster that requires the presence of a vowel to its 
right, as in sucr-ier [sykrije] ‘sugar bowl’. 

3. This description also holds for Charette’s 1998/2003 analysis where, recall from the opening of 
this chapter, the CC cluster of [CCj] (cim tiére) identifies as two onsets, which however have the 
exact same distribution as coda-onset clusters. On this take, if the C’C from /CaCj/ in cimetiére-type 
words can form an onset-onset governing domain, the same should be true for C’C from /CaTR/ in la 
secrétaire ‘the secretary’. 

4. Except when resulting from schwa deletion as in éch ‘ler ‘to climb on a ladder’ for [f1] and gromm ‘ler 
‘to mutter’, jum’ler ‘to twin’ for [ml]. But that does not count since the members of the cluster are 
lexically separated by a schwa. 

5. In French, [Cj] clusters are produced by a number of word types, which fall into cases where the 
yod belongs to the root and others where it originates in a suffix. In the former category, items occur 
whose [Cj] i) may (/i-er ‘to join’, ni-er ‘to deny’) or ii) may not (copi-er ‘to copy’, unifi-er ‘to unify’) 
be word-initial. The latter group is made of three suffixes: iii) -ier -ière (abricot-ier ‘apricot tree’, 
chemis-ier ‘blouse’), iv) -ien (alsac-ien ‘Alsatian’, brésil-ien ‘Brazilian’) and v) 1pl. -i-ons, 2pl. -i-ez 
(imperfect nous batt-i-ons ‘we beat’, subjunctive que nous batt-i-ons ‘that we beat’, conditional nous 
batt-er-i-ons ‘we would beat’). 

Group i) of course cannot produce [CaCj] items, which also appear to be absent in groups ii) and iv). 
Group v) does create [CoCj], but only with [rj] (C’rj). This is because there are no roots ending in [CaC] 
to which imperfective/subjunctive -i-ons, -i-ez could be added, and conditional -i-ons, -i-ez only attach 
to infinitives, which always end in -r. When belonging to the first group of verbs in -er, the -e- appears 
as schwa in the conditional, hence producing [Carj] as in vous aimar-i-ez (aimer ‘to love’). In this case, 
schwa may be dropped if the following yod expands into [rij] (a pattern called diérèse: ram ‘rijez (ramer 
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‘to row’), fil 'rijez (filer ‘to push off’), donn ’rijez (donner ‘to give’), but must be pronounced if followed 
by rj (*ram ‘rjez, *fil’rjez, *donn 'rjez) (Tranel 1981, 285ff., Klein 1992, 23, Dell 1973, 263). 
In sum, thus, only iii) -ier /-iére produces [CaCj] sequences. 
6. Relevant work includes Dell (1972, 1973), Morin (1979 [1971]), Lyche (1979), Tranel (1981, 64-6, 
1987, 115-21), Kaye and Lowenstamm (1984), Klein (1992), Rizzolo (1999), Durand and Lyche 
(1999), and Côté (2018). 
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The prince and the nymph: 
interconsonantal plosive-zero 
alternation in English 


Péter Szigetvari 


Introduction 


Some cluster types of English are variably pronounced with or without a plosive 
in the middle.'! Examples include prince [prin(t)s],? nymph [nim(p)f], month 
{mon(t)0], vax [lin(k)s], strength [stren(k)®], prompt [prom(p)t], instinct 
[instin(k)t], etc. The status of these plosives is not clear: are they ‘underlying’ 
or ‘lexical’, or are they ‘superficial’, ‘emergent’, ‘ephemeral’, or ‘phonetic’ 
phenomena? Or perhaps some are ‘underlying’, others ‘superficial’? Are they 
(independent) segments at all? Or maybe they form ‘affricates’ together with 
the following fricative, at least where such a fricative is available? This is what 
the present chapter concerns itself with, but let me admit right away that no 
definite answers will emerge. Our first question has to be much simpler: what 
is the environment where these plosives alternate? Descriptions, dictionaries, 
and native speakers do not fully agree on the data. 

The title of this chapter tries to be neutral: I am not talking about 
inserted or deleted plosives, but about plosives that alternate with zero 
in interconsonantal position. Dictionaries do not indicate these plosives 
uniformly. The online Merriam-Webster dictionary (henceforth MW)? applies 
the most obvious method: parenthesising the plosive, as I have above. The 
Longman Pronunciation Dictionary (Wells 2008, henceforth LPD) distin- 
guishes between ‘insertable’ plosives which are marked by a raised symbol 
(for example, prince [prin's]) and ‘deletable’ ones which are set in italics 
(for example, chintz [tfints]). Fourakis and Port (1986) have found a slight 
durational difference between the two types. But even if this difference is 
real, native speakers may make it on orthographical grounds: the [t] is not 
indicated in the spelling of prince, but it is in the spelling of chintz. In any 
case, LPD seems to use a raised symbol for the plosive in exactly those 
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words where it is not shown in the spelling and an italic one in those where 
it is shown, suggesting that the former is superficially inserted, the latter is 
underlying and deleted (some further examples: Simpson [simpson] versus 
Simson [sim?son]; century [séntfarij] versus censure [sén‘fa], etc.). What 
raises some doubt about separating the two types is that the variation occurs in 
both directions: a [t] may be inserted between [n] and [s], and the [t] may be 
deleted between [n] and [s]. This makes sense if one either (i) denies that the 
output of this process is a pattern that is phonotactically more desirable than 
the input (since either [nts] or [ns] is ‘better’ phonotactically than the other, 
but not both) or (ii) claims that ‘insertion’ and ‘deletion’ are two names for the 
same process here, ‘underlyingly’ either both prince and chintz contain [t] or 
neither does so. In case (ii), the only difference between these two words is in 
their prevocalic part, their postvocalic portion is identical, so the two words 
are either [prins] and [tfins], or [prints] and [tfints]. 


The phenomenon 


Some of the existing descriptions of this phenomenon are rather anecdotal 
and often mention it merely as an argument for some hypothesis of phono- 
logical representation (Clements 1987; Harris 1994; Kiss 2002). Therefore 
we must first identify the environments where the alternation takes place. In 
doing so, I will limit the phenomenon to alternations that occur between two 
consonants that are within a morpheme. Variable plosives occur in word-final 
position in compounds or before a suffix, as in breaststroke [bres(t)#strawk], 
dumped [dam(p)#t] or thinks [0in(k)#s]. This kind of variation, however, 
does not directly concern us here, since these plosives are variable only when 
followed by an obstruent, but, in any case, a consonant in the next morph, 
without which they are stable: the emboldened plosive is obligatory in 
breastwork, jumping, or thinking. In other words, the plosive is undoubtedly 
not inserted, but deleted in these cases. I also ignore the type of variation 
witnessed in often [of(t)an], since it does not occur between (nonsyllabic) 
consonants and it is lexical, that is, unpredictable (cf. chieftain [tfijfton] with 
no alternation). 


Neighbouring segments 


A variable plosive occurs much more commonly before a fortis fricative or 
plosive, than before a lenis one as shown in (1). Note that [6] does not occur 
after a consonant within a morpheme at all, but even if it did, the prediction is 
that no plosive could turn up before it. We do find a plosive before [z] in (1b) 
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and (1d) and [3] in (1c), more on which below. Grammaticality judgements 
are those of LPD, I do not distinguish the ‘insertable’ and the ‘deletable’ 
plosives. 


(1) Alternating plosives before fortis and lenis fricatives 


fortis lenis 
a. triumphal [trajam(p)fol] triumvir [trajamvo] 
warmth [wo:m(p)@] — 
Gimson [gim(p)son] crimson [krimzon] 
b. infant [in(t)fant] inventory [invontrij] 
anthem [án(t)ðəm] — 
cancel [kan(t)sal] Bermondsey [bő:mən(d)zij] 
c. mensural [mén(t)foral] lingerie [lan(d)3orij] 
d. length [lén(k)@] (also [lén(t)6]) — 
angst [an(k)st] anxiety [an(g)zajatij] 
anxious [ay(k)fos] = 
e. empty [ém(p)tij] lambda [lámdə] 
f. extinct [ikstin(k)t] kingdom [kindom]* 


The data in (1) are representative of the consonants between which the 
alternation of plosives takes place. I only include clusters beginning with a 
nasal, although in an earlier discussion of this phenomenon, Clements (1987) 
also mentions [l]. However, LPD does not indicate this plosive in any word 
containing an [l]+fortis fricative cluster, and MW only have it in some of 
the candidates: else, waltz, health, and Welsh (note the alternative spelling 
Welch), but not in other similar words like pulse, salsa, ulcer, filth, stealth, etc. 
The apparent rarity of the alternation between [I] and a fortis fricative may be 
due to the spread of L-vocalisation: the words affected contain [w] instead of 
[1] for an increasing number of speakers. 

We find very few environments where a lenis plosive alternates: [n_z] 
(as in Bermondsey, in (1b)), [n_3] (as in lingerie, in (1c)), and [n_z] (as in 
anxiety, in (1d)). Although my aim is to not distinguish between cases where 
the plosive is ‘underlying’ and others where it is not, it is noteworthy that in 
all three words, as well as in others like them (Windsor [win(d)za], Lindsey 
[lin(d)zij], plunge [plon(d)3], orange [orin(d)3], etc.) the plosive is indicated 
in the spelling. In fact, the alternation of [nz] and [ndz] may be analysed as 
one of the affricate [dz] with the fricative [3], rather than that of the plosive 
[d] and zero. For example, MW, which does not use standard IPA transcrip- 
tions, has [-ja-] (=[-d3a-]) versus [-zho-] (=[-39-]), and only in /ingerie, not 
it plunge or orange. The [d3]~[3] alternation is a feature of recent French 
loanwords, irrespective of the preceding nasal (cf. garage [gara:3]~[garid3], 
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genre [3z0nra|~[dzonra]). Neither the LPD, nor MW marks an optional 
plosive in words where the spelling shows [nz]® (like bonanza, lens, stanza). 
Nevertheless, citing Fourakis and Port (1986), Clements does mention the 
possibility of a [d] before the [z] in words like Jens, dens, dells (1987: 34). 
The alternation of a lenis plosive is vanishingly rare after a noncoronal nasal, 
even in cases where the orthography indicates one. The labial [b] never 
alternates: ambsace [éjmzejs], lambda [lamda]; but we do find an exception in 
LPD for the velar [g]: anxiety [an(g)zajotij], the plosiveful version labelled as 
‘non-RP’. In what follows, I ignore the possible alternation after [1] or before 
a lenis obstruent, and concentrate only on that which occurs after a nasal and 
before a fortis obstruent.’ 

As already stated, in the vast majority of cases the variable plosive 
occurs before a fortis obstruent. Its place of articulation is that of the preceding 
nasal as shown in (la) and (le) for labials, (1b—d) for coronals and (1d) and 
(1f) for velars, the obstruent that follows does not influence its place. So after 
[m] the alternating plosive is uniformly [p], after [n] it is [t], after [1] it is [k], 
irrespective of whether the following obstruent is [f], [0], [s], [f], as in (la—d) 
or [t], as in (le—f). This means that the plosive appearing between the nasal 
and the following fricative shares its place of articulation and its noncon- 
tinuancy with the nasal, and its fortisness (and obstruency) with the following 
obstruent. I will return to the preplosive alternation in below. 


Stress 


All the words in (1) contain a stressed vowel right before the nasal+fricative 
cluster in which the alternating plosive is located. Yet it is not the status of the 
preceding vowel that matters, but that of the following one. The alternation 
equally occurs after an unstressed vowel, as long as the alternation site is not 
followed by a stressed vowel: triumph [trajom(p)f], entrance [éntron(t)s], 
larynx [larin(k)s].° The data in (2), taken from dictionaries, show that there is 
no alternation before a stressed vowel. Here I mark not only main or ‘primary’ 
stress (the tonic), but also the full vowel following, which, as these data show, 
also counts as stressed. 


(2) Alternating plosives and stress 


not before stressed vowel before stressed vowel 
symphony [sim(p)fonij] symphonic [simfonik] 
nymph [nim(p)f], nymphet [nim(p)fot] nymphet [nimfet] 

Gimson [gim(p)son]* Gimsonian [gimsawnijon]* 
plimsoll [plim(p)sol] plimsoll [plimsaw1] 
Sam(p)son [sam(p)son] Samsung [samson ]* 
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infant [in(t)font] confess [konfés] 


synthesis [sin(t)@asis] synthetic [sinOetik] 
prince [prin(t)s] princess [prinsés] 
insolent [in(t)solont] insect [insekt] 
sensual [sén(t)f(u)wal] ensure [infu:] 
conscious [kon(t)fas] Kinshasa [kinfa:so] 


Jinx [d3in(k)s] = 


All the data in (2) are agreed on by LPD and MW, except for the words 
marked ‘*’, which are LPD-only pronunciations, missing from MW. The 
word princess appears as [prin(t)sos] in MW, with the alternating plosive, 
before an unstressed vowel. MW adds the variant in (2), [prinsés], without a 
plosive before the stressed vowel, with the label ‘usually British’. However, 
the two dictionaries differ in their judgements of the variant [prinsés]: MW 
does not indicate a [t] in it, as we would expect based on, for example, insect, 
which has the same stress pattern. Somewhat surprisingly, LPD does indicate 
a plosive here ([prin(t)sés]), at least for British English. In its American 
variants, LPD conforms to the pattern in (2): [prin(t)sos], [prinsés]. It is not 
clear to me if this is a real difference between the two accents, or simply a 
typographical error. 

The cluster [nks] in jinx and other words of its kind corroborates the 
hypothesis that the distribution of the alternating plosive is related to stress. 
While [nks] occurs word finally (for example, Bronx, lynx, sphinx) and, 
rarely, before an unstressed vowel (for example, Banksy, pinxit), we do 
not find this cluster in any word before a stressed vowel. Etymologically, 
the [k] of an [nks] cluster cannot be epenthetic: [n] is a recent development 
in English, which did not occur before [s], except where a velar plosive, 
alternating in present-day English, intervened. So it is not only that a plosive 
is not inserted in a hypothetical [ns] cluster before stress, but the plosiveful 
[nks] cluster does not exist in this environment either.? We find a similar 
situation with [mps]: it occurs word finally (for example, glimpse, mumps) 
and before an unstressed vowel (mostly in names, for example, Dempsey, 
Empson). I have found a single example with [mps] before a stressed vowel, 
palimpsést, which is a learned word, and possibly the pronunciation is 
influenced by the spelling.!° 

However, in addition to princess, there are further words in LPD 
that are transcribed with an unexpected plosive, like menthol [mén(t)001] 
or samphire [sam(p)fajo]. MW does not indicate a plosive in either of 
these words. To try to figure out what’s going on, I decided to ask native 
speakers. 
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A survey 


I conducted a simple survey on alternating plosives. To do this, I set up a 
website at http://seas.elte.hu/epenthesis, which offered a randomly ordered 
list containing 50 words, each containing a nasal+fortis fricative cluster. 
Each word was given in standard spelling and a transcription in which a 
plosive homorganic with the nasal was printed in red between the nasal and 
the fricative. The participants were mostly phonologists and other linguists, 
recruited through the MFM list (the mailing list of the Manchester Phonology 
Meeting) and the Linguist List. They were asked to mark each item with one 
of three options, ‘yes’ if they thought the word was possible with the plosive, 
‘no’ if they thought it was not, and ‘dunno’ if they could not decide. I recorded 
the answers from 8 November to 7 December 2019. Statistics about the 
participants are provided in (3). 


(3) The survey participants 


¢ Number of participants: 191, of which self-declared native 
speakers: 182. 

¢ Age ranges: under 30: 33; 30-50: 91; 50-70: 44; over 70: 14; 
did not say: 9. 

e Came via: MFM list: 137; Linguist List: 38; friend: 6; other: 10. 


I did not expect the judgements to be influenced by the informant’s dialect, 
so I did not ask for this bit of information. The differences between LPD 
and MW in princess, ménthol, or samphire suggest that this may have been 
a mistake. 

It is obviously not easy to decide by introspection whether a plosive 
is possible or not in a given word. The results of the survey show not what 
the participants produce, but what they believe they produce, or even 
worse, what they believe they should be producing. Even with these restric- 
tions in mind, the results are surprisingly uncategorical. For rarer words, 
over 40 per cent of participants gave a ‘dunno’ answer. Many participants 
shared their concerns and dilemmas after giving their judgements. When 
preparing the survey, I carelessly labelled the plosives as epenthetic, 
which resulted in some concerned comments, especially in the case of 
anxious and palimpsest. Even though the plosive is claimed by many 
to be ‘underlying’ in these words (obviously a claim based on spelling/ 
etymology), 8 and 17 participants, respectively, deemed it impossible to 
have a [k] and [p] in them. 
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The results 


The results are ordered by the ratio of yes and no answers in the first column 
of Table 20.1. If this ratio is greater than 1, there were more participants who 
said they could imagine the plosive in that word than who said they could not; 
if it is less than 1, there were fewer such participants. Thus we have a ranking 
list of words (environments) from most to least favourable for containing 
a plosive between the nasal and the fortis fricative. In the last two columns 
I mark the view of LPD and MW on these plosives. The two dictionaries 
have different strategies. LPD never marks an obligatory plosive between a 
nasal and a fricative; these plosives are either in italics (when present in the 
spelling) or in a raised index (when absent from the spelling).'! The first case 
I mark with a tick (v), the second with a percent sign (%). I use the cross (X) 
to mark words where no plosive is indicated at all. MW only marks one type of 
alternation (here labelled %), in this column the tick marks a plosive which is 
obligatorily present according to MW (only in avalanche). Words with a dash 
are not listed in the relevant dictionary. 

The words in Table 20.1 were not meant to be representative of words 
containing nasal+fortis fricative clusters; instead I aimed at including an 
example for each possible segment combination both with and without 
a stressed vowel following. No neat cut-off point that separates words in 
which the plosive is acceptable and unacceptable emerged from the answers. 
Nevertheless, words with plosive alternation in LPD and MW tend to appear 
at the top of Table 20.1, and words without at the bottom. 

The name Banff contains a [t] in LPD ([ban'f]) with potential labial 
assimilation ([bamPf]), while MW has assimilation variant only; this is marked 
by an asterisk. Several participants mentioned this assimilation in other [nf] 
clusters, while LPD does not indicate any such possibility. !? 


Some discussion 


The two words which were mostly deemed acceptable with the plosive are 
those whose spelling suggests that the plosive is lexical (‘underlying’): 
anxious and palimpsest. The words least likely to have an alternating plosive 
contain [nf]. While it is true that this is a nonhomorganic cluster, words 
containing another nonhomorganic cluster, [9], came out at the top of the list. 
Homorganicity apparently is not (always?) a relevant factor in the likelihood 
of plosive alternation. 

Stress, on the other hand, appears to be relevant, as the dictionary 
data collected in (2) show. However, as it has been noted MW does not, but 
LPD does have some examples that have a plosive before a stressed vowel, 
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Table 20.1 The results of the survey. 


Yes/No Yes Dunno No Word LPD MW 

23.750 181 (95%) 2(1%) 8 (4%) anxious an(k)fas v % 
8.111 140 (73%) 34 (18%) 17 (9%) palimpsest palim(p)sest v % 
6.692 166(87%) 1(1%) 24 (13%) length lén(k)@ % % 
4.824 156(82%) 3(2%) 32 (17%) lengthen lén(k)8on % % 
4.222 145(76%) 11 (6%) 35 (18%) prince prin(t)s % % 
3.850 147 (77%) 5 (3%) 39 (20%) avalanche avola:n(t)f v v 
3.524 142 (74%) 9(5%) 40 (21%) nymph nim(p)f % % 
3.409 143(75%) 6 (3%) 42 (22%) warmth wo:m(p)0 % % 
2.593 133 (70%) 6(3%) 52 (27%) ancient éjn(t) font % % 
1.879 119 (62%) 9 (5%) 63 (33%) Samsonite sam(p)sonajt % — 
1.649 116(61%) 4(2%) 71 (37%) emphasis ém(p)fosis % % 
1.639 113 (59%) 9 (5%) 69 (36%) sensual sén(t)fwal % % 
1.325 101 (53%) 14(7%) 76 (40%) nympho nim(p)faw % % 
1.209 100(52%) 9 (5%) 82 (43%) comfort kdm(p)fat % % 
1.133 65 (34%) 68 (36%) 58 (30%) samphire sam(p)fajo % x 
1.067 91 (48%) 14(7%) 86 (45%) censor sén(t)so % % 
1.051 78 (41%) 38(20%) 75 (39%) penchant pon(t)fon % % 
0.979 88 (46%) 14 (7%) 89 (47%) symphonic sim(p)fonik x x 
0.827 83(43%) 9 (5%) 99 (52%) anthem án(t)ðəm % % 
0.788 78 (41%) 14 (7%) 99 (52%) philanthropist filán(t)Ərəpist % % 
0.731 72 (38%) 20(10%) 99(52%) mentholated mén(t)ðəlejtid % % 
0.696 75(39%) 9(5%) 107(56%) month mén(t)0 % % 
0.659  52(27%) 60(31%)  79(41%) Plimsoll plim(p)sal % % 
0.636 54(28%) 53(28%)  84(44%) Plimsoll plim(p)sowl x x 
0.621 69(36%) 11(6%) 111 (58%) menthol mén(t)ðol % x 
0.565 50 (26%) 53 (28%)  88(46%) amaranthine ámərán(t)ðajn % x 
0.465 38(20%) 70(37%) 83 (43%) Gramsci gram(p)fij — — 
0.436 33 (17%) 83(43%) 75 (39%) Gomshall gom(p)fal x — 
0.406 54(28%) 5(3%) 132(69%) amphibian am(p)fibijon x x 
0.396 36(19%) 63(33%) 92 (48%) benthos bén(t)ðos % x 
0.377 49(26%) 10(5%) 132 (69%) Samsung sam(p)sun x = 
0.371  50(26%) 7(4%) 134(70%) anthology an(t)0dlod3ij x x 
0.322 37(19%) 42(22%) 112 (59%) Honshu hon(t)fuw x x 
0.296 41(21%) 14(7%)  136(71%) synthetic sin(t)Oétik x x 
0.283  24(13%) 79(41%) 88 (46%) Jamshid dzam(p)fijd x x 
0.257 = 37(19%) 13(7%) 141 (74%) philanthropic _ filon(t)®ropik x x 
0.246 30(16%) 37(19%) 124(65%) Kinshasa kin(t)fa:so x x 
0.234 35(18%) 9(5%) 147(77%) conceive kon(t)sijv x x 
0.231 34(18%) 8(4%) 149(78%) insect in(t)sekt x x 
0.228 34(18%) 7(4%) 150(79%) circumcise so:kom(p)sajz x x 
0.212 33(17%) 5(3%) 153 (80%) infant in(t)font % x 
0.177 27(14%) 13(7%)  151(79%) circumcision sd:kom(p)sizan X x 
0.159 25(13%) 9(5%) 157 (82%) Williamson wiljom(p)son x x 
0.146 22(12%) 12(6%)  157(82%) insure in(t)fo: x x 
0.135  20(10%) 29(15%) 142(74%) Banff ban(t)f % * 
0.103 18 (9%) 6(3%)  167(87%) himself him(p)sélf x x 
0.091 15 (8%) 8(4%) 168 (88%) conference kón(t)frəns x x 


(Table continued overleaf) 
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Table 20.1 (continued) 


Yes/No Yes Dunno No Word LPD MW 
0.049 8 (4%) 26(14%) 157 (82%) infarct in(t)fa:kt x x 
0.036 5 (3%) 25(13%) 161 (84%) confab kon(t)fab x x 
0.031 5(3%) 2(1%)  184(96%) confess kon(t)fés x x 


though this is never main (‘primary’) stress: menthol [mén(t)00l], samphire 
[sam(p)faja]. The survey partly supports the role of stress: the words at 
the top of the scale do not have stress after the alternation site, those at the 
bottom do. Nevertheless, native speakers also provide us with some surprising 
judgements, for example, the relatively high ranking of a plosive before the 
main stress in symphonic [sim(p)fonik]. Although MW (though not LPD) sees 
a clear difference between menthol (only [-n0-]) and mentholated [-n(t)8-], the 
participants of this survey do not: menthol, with 69 yes responses, fared only 
slightly worse than mentholated, which had 72. The categorical difference 
in both LPD and MW between the two variants of Plimsoll [plim(p)sal] and 
[plimsawl] is also nonexistent according to the current survey. The absence of 
conference *[kóntfrəns], however, is corroborated; more than twice as many 
participants ruled out this form than the plosive in a similar position in infant 
[intfont] (also cf. LPD’s infra [in(t)fro]). 

It can generally be stated that native speakers accept the alternating 
plosive less readily than dictionaries. The yes/no ratio sinks under | well 
before the possibility of alternation disappears from LPD or MW. 


Theoretical considerations 


Although there is no categorical distinction between the possible appearance 
of a fortis plosive between a nasal and a fortis fricative, stress does seem to 
play a role: the plosive occurs significantly more rarely when a stressed vowel 
follows the cluster than otherwise, that is, when an unstressed vowel follows or 
at the end of a word. This state of affairs is odd. It is a well-known fact about 
natural languages that more contrasts are available before a vowel than at the 
end of a word (for example, Steriade 1999; Ségéral and Scheer 2001), and 
more contrasts are available before a stressed vowel than before an unstressed 
one (for example, Harris 1997; Cyran 2010). Possibly for the same reason, 
more complex clusters are available before a stressed vowel (cf. Government 
Licensing, Charette 1991, 1992) than otherwise. If the [nts] in concert [kontsot] 
is a three-member consonant cluster and this three-member cluster cannot 
occur if the following vowel is stressed, as in concept [konsépt], then this does 
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not comply with the observations made by Charette: the second, unstressed 
vowel of concert licenses a more complex cluster than the second, stressed 
vowel of concept. An additional problem with the three-consonant-cluster 
analyses is theory internal to Government Phonology and its forks: with the 
restricted possibilities of building syllables, a nasal+plosive+fricative cluster 
can hardly be analysed. The nasal+plosive part cannot be parsed as a coda in 
this framework, and the plosive+fricative part cannot be parsed as an onset. 
The third option, an empty nucleus somewhere in the cluster is also not likely in 
this situation, since this empty nucleus would not be present in the plosiveless 
alternant. Inserting a ‘syllable’, that is, an onset and the following (empty) 
nucleus in the representation is a highly dispreferred scenario in this theory. 

Treating the [ts] in concert as an affricate seems to solve the syllabifica- 
tion problem of Government Phonology, but it is not an acceptable solution 
for other reasons. In this case the number of ‘segments’ does not increase 
in the change from [ns] to [nts], so government licensing is not affected 
and parsing is possible with the nasal being a coda, the affricate an onset. 
Nevertheless, a fricative turning into an affricate is fortition, and we do not 
expect fortition to occur word finally and before an unstressed vowel to the 
exclusion of a stressed vowel. Furthermore, while [ts] or [tf] could be analysed 
as affricates, most other clusters resulting in this process, [ps], [pf], [p0], [tf], 
[ks], [kf], [k0], can hardly be. 

A nasal+fricative cluster, like [ns], is more marked (that is, less 
common cross-linguistically) than a nasal+plosive or nasal+affricate cluster, 
like [nt] or [nts] (where, despite the current practice of the IPA, I use a ligature 
symbol for the ‘affricate’ to distinguish it from a three-member [nts] cluster). 
The questions are (i) whether the three-member cluster [nts] could be analysed 
as less marked than [ns], and (ii) whether the three-member [nts] cluster can 
be seen as distinct from a two-member [nts] cluster in any sense. I am not sure 
what the right answers to these questions are. 


Nasal+plosive clusters 


I conclude this chapter with a phonotactic consideration of the status of the 
alternating plosives. Such a plosive is found not only before a fricative, 
but also before another plosive or affricate. In this case the nasal is always 
noncoronal, [m] or [p], while the plosive or affricate is coronal, [t] or [tf]. The 
plosive alternating between a nasal and another plosive seems to be insensitive 
to stress. Such a plosive is indicated by both LPD and MW before a stressed 
vowel, even with the main stress. This alternation, however, never occurs 
before a lenis plosive; what is more, an earlier plosive is lost before a lenis 
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plosive: Camden < camp+den versus Hampton < ham-+tiin. Further examples 
are listed below, with a labial nasal in (4a), a velar one in (4b). 


(4) Alternation before a plosive 
not before stressed vowel before stressed vowel 


a. prompt [prom(p)t] Comptometer [kom(p)témita] 
dreamt [drem(p)t]'? asymptote [asim(p)tawt] 
empty [ém(p)tij] asymptotic [asim(p)totik] 


Sumter [sdm(p)te] 
sumptuous [som(p)tfas] 


temptress [tém(p)tras] temptation [tém(p)téjfan] 
b. succinct [soksin(k)t] cunctation [kan(k)téjfen] 
Langton [lan(k)ten] punctilio [pan(k)tilijow] 
sanctum [say(k)tom] sanctorum [say(k)td:rom] 
juncture [d3an(k)tfa] tinctorial [tin(k)to:rijal] 


The pretonic alternation of a plosive appears to be more common within a 
nasal+plosive than within a nasal+fricative cluster. It is noteworthy that in 
this case, the three-member cluster with the plosive in the middle comprises 
two subclusters, both of which are less marked (more common) than the 
two-member cluster without the plosive in the middle. Thus, while the nonho- 
morganic nasal+plosive [mt] or [nt] clusters are not commonly encountered 
in English, all the more so since they are optionally split by another plosive, 
the clusters resulting from the presence of this plosive, [mp] and [pt], [nk] and 
[kt], are common. This is why the nasal in this alternation is always noncoronal 
and the stable plosive coronal. The appearance of a plosive in a homorganic 
nasal+plosive cluster would result in a geminate, which is ruled out in English 
(*[mpp], *[ntt], *[ntt{], *[nkk]), while the appearance of a coronal plosive in 
a coronal nasal+noncoronal plosive cluster would yield the unattested [tp] 
([ntp]) or the rare [tk] ([ntk]) subcluster. Note that the rare occurrences of [np] 
and those of [nk] are predominantly before a stressed vowel (for example, 
Kanpur [kánpó:], incline [inklajn]). 


Conclusion 


English words containing a nasal followed by an obstruent often contain a 
plosive between the two consonants. This alternation occurs predominantly 
before a fortis, rather than a lenis obstruent. In the case of fricatives, the 
alternation is much more common before an unstressed vowel or word finally 
than before a stressed vowel. The alternation before a plosive, on the other 
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hand, is not sensitive to stress, but it exclusively occurs before a fortis plosive, 


never before a lenis one. A theory explaining these facts is badly needed. 
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I use the simple symbols [i], [e], [a], [9], [0], and [u] to represent the vowels of English. In the 
present chapter nothing hinges on this choice; my symbols can be replaced by more serious-looking 
ones of the reader’s preference (for example, [1], [e], [æ], [4], [o0]/[0], or [e]/[e]/[v], respectively). 

https://merriam-webster.com (accessed January 2020). 

There is no truly monomorphemic example for [nd]. 

For waltz LPD has [wols], [wo:ls], [wolts], [wo:lts]; curiously not the usual [wol's] or [wolts]. For 
Helmholtz, schmaltz, Schultz there is no [t]-less form in either the LPD or MW. LPD also contains 
Torvalds [to:valts], [t6:valdz]. Here the lenis, but not the fortis plosive is marked as variable, which 
constitutes a unique case. 

Note that [n3] does not have a standard orthographical representation in English. The letter g or j is 
[3] in some recent French loans (like /ingerie or jupe), but then it is also often [d3] in the same words. 
Other instances of [3] occur only intervocalically, as a result of the palatalisation of [z], spelled s or z 
(pleasure [plé3a], azure [a39]). 

A plosive (this time always lenis) also appeared earlier between a nasal and another sonorant: OE 
*crymlan > ME *crymblen ‘crumble’, OE cynréd > ModE kindred. This process is not active any 
more, we do not find any alternation in this environment in either direction: omelette [6m(*b)lat], 
general [dzén(*d)ral], assembly [asém*(b)lij], hundred [han*(d)rad]. 

The second vowel of triumph and larynx is probably not unstressed. We do not normally find an 
unstressed vowel before word-final [mp], [mf], [pk] clusters. Also note that LPD, which uses different 
symbols for stressed and unstressed schwa, a and ə, respectively, transcribes triumph as ['trat amPf], 
though it cannot indicate the difference in larynx, because, somewhat inconsistently, it uses the same 
1 for both the stressed and the unstressed vowel in this case. See Szigetvari (2017, 270f.) for further 
discussion. 

Apart from a variant pronunciation of anxiety, [pgz] does not exist either. 

The word metempsychosis is [métempsajkéwsis] in LPD. This may also be a spelling pronunciation. 
In MW we find the expected contrast: [matém(p)sikawsis]~[métemsajkdwsis]. 

Thus we find an italic plosive in glimpse [glimps], although etymologically this [p] is epenthetic; the 
word was earlier spelled glimse. 

The contrast of [nf] and [mf] is also brought up in Modern Family S02E10: at 4:06 Phil Dunphy is 
at great pains explaining the pronunciation of his name to his father-in-law. This issue reoccurs on 
several occasions in the series. 

Although dreamt consists of two morphemes, it is different from dumped [dam(p)t], since the [p] is 
obligatory in dump, but missing in dream. 
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Prosodic structure and recursion: 
a brief introduction 


Connor Youngberg and Florian Breit 


Recursion in syntax and phonology 


While recursion is a well-accepted feature of language within generative syntax, 
within phonology its existence and, if it exists, the role it might play continue 
to be keenly debated (see, for example, Vogel 2012; Martinez-Paricio and 
Kager 2015; Nasukawa 2020 and contributions therein). An early example from 
Government Phonology (GP) is the ‘matrix calculus’ of Kaye et al. (1985), who 
proposed that elements such as |A, I, U| undergo post-phonological spell-out 
into phonetic feature matrices of the type familiar from SPE (Chomsky and 
Halle 1968). To achieve this, they proposed what is essentially an asymmetric 
recursive algorithm which fuses feature matrices representing each element’s 
phonetic substance, selectively retaining feature values from head or operator. 
The approach faced significant challenges (Coleman 1990a, 1990b; Kaye 1990) 
and has given way to a more atomistic view in modern Element Theory (ET), 
which maps elements to patterns in the acoustic signal (Lindsey and Harris 1990; 
Harris and Lindsey 1993). For all intents and purposes, following work in GP has 
adhered to the view that phonology is essentially recursion-free. 


Recursive approaches to prosodic structure 


More recently, there have been a number of proposals which submit that both 
melodic and prosodic representations involve recursion to varying degrees. 
Likely the three most prominent proponents of this view in the GP/ET- 
tradition are GP 2.0 (Péchtrager 2006; Zivanovié and Pochtrager 2010), 
Radical CV Phonology (RCVP; van der Hulst 2020), and Precedence-free 
Phonology (PfP; Nasukawa 2014; Nasukawa and Backley 2015). 
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GP 2.0 


GP 2.0 posits that there is a strong and inherent connection between certain 
‘anomalous’ elements and their prosodic behaviour. Namely, |A, H, ?| 
compared to |U, I, L| show both a more restricted distribution (for example, |?| 
is argued to never plausibly occupy nuclei) and are frequently linked to length 
phenomena (for example, [V:sC] sequences in British English must have |A| in 
either V or C). The fundamental proposition of GP 2.0 is that |A, H, ?| are not 
melodic primes, but the result of pure structure. Compare the two approaches 
to the Pulaar continuant-stop alternation under pluralisation as in [lefol] 
‘pennant’ — [lep:i] ‘pennants’ in (1), where GP 2.0 is able to capture the 
three factors of suffixation, fortition, and gemination all as a single structural 
alternation, namely an additional projection of the head xO. 


(1) a. Lateral (~Standard GP) b. Embedded (GP 2.0) 
ONONON ONONON O' oO" 
A JT T ax 
x X XX 1 i i ; a xı xOy x, O' 

Ll || 
leUol l&u ? i X€xO iy; 
[f] [p:] [f] [p:] 


Precedence-free Phonology 


Not dissimilar in seeking a more unified explanation of prosody and melody, 
PfP also makes use of recursive embedding but proposes that by retaining the 
melodic primes as the primary combinatory unit and combining these more 
freely into larger embedded structures we can dispense with certain lateral 
relations and prosodic categories. This approach is illustrated in Chapter 22, 
where Backley and Nasukawa set out an analysis of French nasal vowels and 
Japanese moraic nasals without reliance on feet, syllables, or segments as 
distinct phonological categories. 


Radical CV Phonology 


RCVP again pursues many similar aims to GP2.0 and PfP by the means of 
structure. However, both the stated aim and assumptions of the approach 
differ somewhat. First, RCVP is much more strongly rooted in Dependency 
Phonology (Anderson and Ewen 1987) than other current work. Second, it 
pursues a radical reductionism whereby only two feature primitives (the units 
C and V) are assumed. Recursively combining these two units in specific 
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dependency configurations, RCVP posits that sufficient structural restrictive- 
ness then permits the relevant categories to be read off these configurations. 
While early on the central focus of this approach was on melodic categories, 
van der Hulst has proposed many further developments of the theory which 
capture much prosodic structure by further expanding the structures of C and 
V units. Especially when expanded to complex prosodic structure, as in van 
der Hulst’s Chapter 23, RCVP makes the case not only for the maximisation of 
structure in order to reduce melodic or prosodic complexity, but for extensive 
recursivity inside phonology as a powerful explanatory device. 
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Nasal vowels in French: a 
precedence-free approach 


Phillip Backley and Kuniya Nasukawa 


Introduction 


It is estimated that about one fifth of the world’s languages use nasality 
contrastively in vowels.! In French, Portuguese, Yoruba, and Punjabi, for 
instance, oral and nasal vowels are lexically distinct, for example, Punjabi / 
lù:/ ‘hair’, /lu:/ ‘heat wave’ (Bhatia 1993, 337). By contrast, languages which 
do not belong in this group — English, Italian, Hawaiian, Persian, and countless 
others — have no such contrast. Instead, they tend to display contextual nasalisa- 
tion, where nasality is systematically realised on vowels in the context of nasal 
consonants, as in English calm /ka:m/ [kd:m]. 

The focus here is not on context-dependent nasalisation but on 
contrastive nasal vowels, and specifically, those in French. French nasal vowels 
(henceforth FNVs) have been treated extensively in the literature and the 
data are already familiar to many. In this chapter, however, we take the novel 
approach of analysing FNVs using a version of the Element Theory model 
called Precedence-free Phonology or PfP, in which melodic structure is built 
around (vertical) head-dependency relations rather than (horizontal) sequences 
of segments. We begin by describing the history of FNVs and their distribution 
in modern French. Two analyses of FNVs are then presented, firstly, one using 
a standard derivational approach and, in the next section, another using a PfP 
approach. Finally, we draw parallels between our PfP analysis of FNVs and a 
PfP account of the Japanese mora nasal. 
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FNVs and their historical development 


Regarding the status of FNVs,? a point of contention concerns whether they 
should be treated as lexical or derived (Tranel 1981, 3ff.). The lexical view 
taken in Martinet (1971, 143) and Trubetzkoy (1967, 130ff.) treats FNVs as 
distinct phonemes with their own nasality feature, allowing them to contrast 
with oral vowels. On the other hand, the derived view taken in Dell (1970) 
and Schane (1968) sees FNVs not as single segments but as sequences of an 
oral vowel plus a nasal consonant. In a PfP approach, the question of whether 
FNVs are derived or not is irrelevant because PfP is a non-derivational 
model. 

The historical development of FNVs is thought to have gone through the 
stages in (1).° Note that, although scholars disagree over precisely when these 
changes took place, this issue is not important to the present discussion, which 
is concerned with how to represent the melodic changes themselves. 


(1) stage 1 stage 2 stage 3 
(Old French) (Middle French) (Modern French) 
a. lande‘moor’ [lando] [lado] [lad] 
b. bon ‘good’ [bôn] [b3] [bô] 
c. dame ‘lady’ [dămə] [damə] [dam] 


In stage 1 there was no contrast between oral and nasal vowels. Presumably, 
however, vowel nasalisation took place as a coarticulatory effect, as in 
[landə]>[lãndə] ‘moor’. Then in stage 2 a nasal consonant deleted after 
a nasalised vowel, as in [bõn]>[bõ] ‘good’. Also, nasalised vowels were 
denasalised before intervocalic /n, m/ and these nasal consonants were 
retained, giving the pattern [dămə]>[damə] ‘lady’. Finally, stage 3 saw 
the loss of word-final /ə/, leading to modern forms such as lande [lad] and 
dame [dam]. The two patterns of FNV distribution in present-day French are 
illustrated in (2). 


(2) a. Non-alternating (nasal) 


sombre [sobs] ‘dark’ 
lande [lad] ‘moor’ 
cinquante |sčkăt] ‘fifty’ 


b. Alternating (nasal ~ oral) 
bon [bò] ~ bonne [bon] ‘good’ (masc. ~ fem.) 
paysan [peiza] ~ paysanne [peizan] ‘peasant’ (masc. ~ fem.) 
maison [mez] ~ maisonnette [mezonet] ‘house’ ~ ‘little house’ 
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The difference between the two patterns corresponds to a difference in the way 
the historical nasal consonant was syllabified in Old French. In the (2a) pattern, 
the nasal (which has since elided) was followed by an obstruent, which places it 
in arhymal complement position, as in (3a). By contrast, in the (2b) pattern the 
original nasal was morpheme-final (in uninflected words). Using the syllabi- 
fication system set out in Charette (1991) and later employed in Harris and 
Gussmann (1998), this places it in the onset of a final ‘dull’ syllable, as in (3b). 


(3) a. sombre /sən.bgə/ ‘dark’ b. bon /bs.n/ ‘good’ 


First, consider the words in (4). Like sombre [sõbg], they have a non- 
alternating nasal vowel. Historically, they contained a nasal consonant in a 
syllable ‘coda’ (rhymal complement). 


(4) Non-alternating (nasal) pattern 
lande [lad] ‘moor’ 
donc [dõk] ‘therefore’ 
ensemble [àsăbl] ‘together’ 
singe [sèz] ‘monkey’ 


There is nothing unusual about the pronunciation of these words in modern 
French, since they have been shaped by the kinds of phonetic effects and/or 
phonological changes that show up in many languages. For phonetic reasons 
(that is, coarticulation), the vowels in (4) are produced with nasal resonance 
because they precede historical nasal consonants. And for phonological 
reasons (that is, preservation of melodic/segmental properties), these historical 
nasals were free to elide because nasality appears on the neighbouring vowel. 
The nasal place property is also preserved, since the nasal was homorganic 
with the following obstruent, for example, [n] in Zande, [n] in donc, [m] in 
ensemble. Notice that the loss of this coda consonant leads to a less marked 
syllable structure — a frequent driver of diachronic change cross-linguistically. 

Next, consider the words in (5). Like bon [bò], their vowels alternate 
between nasal and oral. Historically, as uninflected words they had a final 
nasal consonant syllabified as in (3b). 


ELEMENTS, GOVERNMENT, AND LICENSING 


(5) Alternating (nasal/oral) pattern 


a. Nasal vowel b. Oral vowel 
brun [bse] ‘brown (masc.)’ brune [bsyn] ‘brown (fem.)’ 
chien [fjé] ‘dog (male)’ chienne [fjen] ‘dog (female)’ 
tient [tj@]  ‘(he) holds’ tiennent [tjen]  ‘(they) hold’ 


tiens [tjé]  ‘(I) hold (indic.)’ tienne [tjen] ‘“(T) hold (subjunc.)’ 


Again, there is nothing unusual about the modern pronunciation of the (5a) 
words. As in (4), they display vowel nasalisation as a coarticulatory effect, 
allowing the final nasal consonant to elide without losing nasality altogether. 
At first glance, the loss of this final consonant appears to result in a loss of 
contrastive information, since its place specification is not shared by neigh- 
bouring sounds, unlike in (4). On this point, however, Azra (2000) notes that 
in words such as bon, chien, etc., the final nasal is invariably dental, so the 
nasal’s place property is already non-distinctive. On this basis, nasal consonant 
elision in (4) and in (5a) may be treated in parallel, with both changes resulting 
in a less marked syllable structure — the loss of a coda consonant in (4) and the 
loss of a domain-final consonant in (5a). 


Representing FNVs (derivational approach) 


First, consider the difference between the non-alternating vowels in (4) and the 
alternating vowels in (5). As (3) shows, they differ in syllable structure. And 
we suggest that this accounts for their distinct phonological behaviour. The 
vowels in (4) are domain-internal, so their phonological and morphological 
environments never alter. This makes them impervious to external influences, 
and consequently they do not alternate. By contrast, the vowels in (5a) are 
domain-final, so their environment can change as a result of suffixation. Here 
the focus is on cases where suffixation causes an oral vowel to appear in the 
stem (following the (5b) pattern) rather than a nasal vowel (following the (5a) 
pattern). 

Modern French distinguishes between nasal vowels (V) and sequences 
of an oral vowel plus a nasal consonant (VN). In nouns and adjectives, they 
mark different genders: masculine forms end in V (chien, brun) and feminine 
forms in VN (chienne, brune). In some verbs V and VN distinguish singular 
versus plural: singular 3 forms (il tient [tj] ‘he holds’) end in V while plural 
3 forms (ils tiennent [tjen] ‘they hold’) end in VN. And V versus VN can also 
separate indicative mood (je tiens [tjè] ‘I hold’) from subjunctive mood (je 
tienne [tjen] ‘I may hold’) in some present tense verb forms. 

The difference is typically explained by referring to a ‘schwa morpheme’ 
which, as a suffix, triggers phonological effects that encode grammatical 
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features of the kind just described. However, not all scholars agree on how 
schwa should be represented in French. We choose to reject earlier treatments 
of schwa (for example, Schane 1968) which rely on a degree of abstractness 
to capture vowel alternations in purely melodic (segmental) terms. Instead, we 
focus on the prosodic properties of schwa. Our approach follows Anderson 
(1982), which treats French schwa as a melodically unspecified (empty) 
nucleus. That is, we see schwa as a prosodic entity rather than a segmental 
one. 

To account for the nasal/oral alternations in (5), we assume that 
unspecified schwa can exist as an independent morpheme. To show how 
this schwa morpheme is used, we turn to the forms in (5b) and outline their 
morphological structure in (6b). Note that schwa is transcribed here as /ə/, 
even though it may be phonetically silent in French or may be realised with a 
[ce] quality.* 


(6) a. brun /bsyn/ [bs&] ‘brown (masc.)’ 
chien /fjen/ [fe] ‘dog (male)’ 
tient /tjen/ [tjé] ‘(he) holds’ 
tiens /tjen/ [tjé] ‘(T) hold (indic.)’ 


b. brune /bgyn/ +/ə/ [bsyn] ‘brown (fem.)’ 
chienne /fjen/+/3/ [fen] ‘dog (female)’ 
tiennent /tjen/+/ə/ [tjen] “(they) hold’ 
tienne /tjen/+/ə/ [tjen] ‘“(T) hold (subjunc.)’ 


The words in (6a) have nasal vowels, and they illustrate the rule of consonant 
deletion in French, which removes word-final consonants. It applies unless 
the consonant is followed by a vowel-initial word within the same phrase. So, 
final /n/ in brun, chien, tient, and tiens deletes when the word is pronounced 
in isolation. Again, the nasality associated with final /n/ is preserved by being 
phonetically realised on the stem vowel, producing a nasal vowel. That is, 
the lexical sequence VN is interpreted as V. Expressed in derivational terms, 
vowel nasalisation is shown in (7). Note that the deletion rule in (7b) actually 
targets prosodic structure, not segmental structure, since it deletes the onset of 
the word-final syllable. 
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(7) a. brun mF (lexical) b. brun y (realisation) 


r y a. 

| i = | | 

N N N N 
| ae 
b x [U| |L?]| b x [HMU] E DD] 
To be phonetically realised, the nasality represented by |L| must associate to 
another position. In this case it attaches to the preceding nucleus to create a 
nasal vowel. While the nasality element |L| reassociates to the nucleus, the 
occlusion element |?| also present in /n/ cannot reassign because in French (and 
most other languages) |?| does not appear in a syllable nucleus. 

Now consider the words in (6b), which contain the schwa morpheme 
and are realised with an oral vowel. Their final schwa is silent because this 
vowel is nothing more than an empty nucleus (which, in French, may be left 
uninterpreted — see section below, ‘PfP structures in French’). Yet although 
no vowel sound is pronounced, the prosodic structure of the schwa morpheme 
remains intact. And this structure protects the final /n/ of brune from deletion 
since it is not in the final syllable of the word. The stem /bgyn/ is therefore 


unchanged: the oral vowel remains oral and the final /n/ avoids deletion, as 
(8) shows. 


(8) a. brune /bsyn +ə/ b. brune [bsyn] 
o o o o 
“1 Al pd eal | 
oO R O RR O R R R 
| | | > | || 
N N N N N | 
b x e IL 21 | b x bs IL 2 II (0) 


Although these analyses depart from some traditional accounts of FNVs (for 
example, Schane 1968), they are nevertheless ‘conventional’ by virtue of their 
derivational approach. In the next section we contrast this with an alternative 
analysis based on the representational approach PfP. 
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Representing FNVs (PfP approach) 


Representations in PfP 


In PfP, phonological representations do not refer to precedence relations 
(that is, linear ordering) between structural units, the aim being to eliminate 
redundancy (Anderson 1987; Backley 2021; Backley and Nasukawa 2020; 
Nasukawa 2011, 2014, 2015, 2016, 2017ab, 2020; Nasukawa and Backley 
2015, 2017).° And because PfP does not treat precedence as a formal property, 
representations do not include timing units such as skeletal positions, CV 
units or Root nodes. From this it also follows that the unit ‘segment’ has no 
formal status. Instead, segments and the precedence relations between them 
are viewed as the phonetic outcome of externalisation. 

In PfP, elements are the only structural units permitted in phonological 
representations. Unlike in other element-based approaches, however, elements 
in PfP have a dual function: they represent melodic properties in the usual 
way and they also project onto higher structural levels where they behave as 
organising units. That is, they function as prosodic constituents, taking the 
place of traditional prosodic units such as ‘nucleus’, ‘mora’, ‘syllable’, and 
‘foot’. 

In PfP, the unit corresponding to a nucleus is represented by a single 
element from the set of resonance elements {|Al, |I|, |U|}. The choice is 
parametric, the chosen element reflecting the phonetic quality of a language’s 
baseline resonance. For example, English uses |A| (realised as [9] in its acous- 
tically weak form), Fijian chooses |I| (realised as weak [#]), and Japanese 
selects |U| (realised as [w]) (Nasukawa 2014). We propose that French is 
like English in having |A| as its base element. By itself, the base element 
is pronounced as a weak or non-contrastive vowel. By contrast, full vowels 
have more complex structures in which the base element takes one or more 
elements as dependents. Example structures for full vowels are given in 
(9bcd), all with |A] as their base element. Note that the base element functions 
as the head of each expression. 


(9) Vowel representations in PfP (Nasukawa 2020, 15) 


a [oa] b [i] c. [ul d. [a] 
IA] |A| IA] |A| 
Ss AS [S 
[A] II IA] lU] IA] JAI 


(9a) is a minimal structure containing only the base element |A]. Recall that 
the base element functions as a prosodic unit equivalent to a nuclear position, 
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so (9a) has the properties of an empty nucleus, that is, a nuclear position with 
no dependent (melodic) structure. Predictably, it behaves as a default or non- 
contrastive vowel which may serve as a reduced vowel or as an epenthetic 
vowel in loanwords. This vowel is usually transcribed as /a/, but its non- 
contrastive status means that its phonetic quality can vary, for example, [9] in 
English, [9]~[ce] in French. 

By contrast, (9b—-d) have complex structures in which the base element 
has dependents. Dependent elements are incorporated into the structure using 
a syntax-like Merge operation (Nasukawa 201 7a). The elements |I|, |U|, and |A| 
are present as dependents in (9b), (9c), and (9d) respectively, and in each case 
the acoustic pattern of the dependent element overrides the baseline resonance 
from the base element |A| to give the full vowels [i], [u], and [a]. What this 
shows is that the phonetic realisation of a head-dependent structure is largely 
determined by the relative prominence of its dependent element(s). This is 
formalised as in (10). 


(10) The Principle of Phonetic Realisation of Head-Dependency 
Structure 
(Nasukawa 2014, 2015, 2016, 2017a,b; Nasukawa and Backley 2015, 
2017, 2018) 


Dependents, which are not necessary for structural well-formedness, 
are phonetically more salient in terms of their modulated carrier signal 
than heads, which are important for building structure. 


In the mapping between phonology and phonetics, the prominence of 
dependents is attributed to the relatively larger modulations of the carrier 
signal with which these elements are realised phonetically.’ This applies not 
only in structures with two elements but also in more complex structures with 
further levels of embedding. 


PfP structures in French 


Using the PfP architecture, (11) gives representations of the melodic units that 
are relevant to the analysis of FNVs. 
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(11) a. [n] b. [nə] 


2 A| 
AA A } [2] 


I |? [217 IAI 
71| 
IL] M I |?) f [n] 
AA 
IL [Il 


As already mentioned, we assume that the base element is |A|. This is 
because the default vowel in French can be pronounced [a], which is the 
baseline resonance produced by realising the base element |A| on its own, 
that is, without dependent structure (see 9a). By contrast, [n] in (11a) has the 
relatively complex structure |? I L|. These elements combine via the Merge 
operation mentioned earlier, creating a hierarchical structure in which each 
element occupies a different level of embedding. Comparing (9a) and (1 1a) 
reveals that the basic split between consonants and vowels is expressed in 
PfP by a difference in their base element: vowels take their base element 
from the resonance set {|A]|, |I|, |U|} while consonants choose from the non- 
resonance set {|H], |?|}. In the case of nasal consonants, the base element is |?| 
(Nasukawa and Backley 2018), so the expression in (11a) is a nasal consonant 
rather than a nasal vowel. In (1 1a) the base element |?| has two dependents: |L] 
adds nasal resonance® and |I| provides the dental place specification for [n]. 
By merging the vowel expression [9] in (9a) with the consonant 
expression [n] in (11a) we get the syllable-sized unit in (11b). The Merge 
operation creates a head-dependent relation between the vowel domain and 
the consonant domain, and this relation then determines how the overall 
structure is externalised. Externalisation is formalised by a Principle of 
Precedence (Nasukawa et al. 2019) which states that lower (that is, more 
deeply embedded) melodic structure is phonetically realised before higher 
melodic structure. Therefore, the (dependent) C-domain precedes the (head) 
V-domain when a syllable-sized unit such as (11b) is pronounced, giving the 
default CV pattern rather than VC.’ It is not possible to realise the vowel and 
consonant expressions in (1 1b) simultaneously, because each requires different 
articulatory gestures. So, a staggered phonetic realisation is necessary. 


FNVs (nasal vowel alternant) 


In principle, (11b) can be realised as the CV sequence [nə]. But when this 
structure is word-final in French, only [n] is pronounced. This is because 
French belongs to the set of languages allowing words to end phonetically 
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in a consonant. In Government Phonology this behaviour is captured by the 
Domain-Final Empty Nucleus Parameter (Kaye et al. (KLV) 1990; Charette 
1991; Harris 1994), which controls whether a word-final or morpheme-final 
empty nucleus is pronounced or not. The same pattern is expressed in PfP 
by the parameter in (12) which, to be consistent with the aims of PfP, avoids 
referring to linear ordering between melodic units. 


(12) The Ultimate Head Parameter (UHP)!° 


The highest base element in a morphological domain is p-licensed!'! if 
it has no vocalic dependents. [OFF/ON] 


So, because the UHP is set to ON in French,!? the vowel of (11b) is silent 
when it appears word-finally (see (5b) above). In this respect, French differs 
from languages such as Zulu and Japanese, in which words must end phoneti- 
cally in a vowel because the UHP is set to OFF. In these systems, the highest 
base element must be audible even when it has no vocalic dependents, in 
which case it is pronounced as a default vowel. For instance, the base element 
in Japanese is |U|, which is phonetically realised as [w], for example, geemu 
[gemu] ‘video game’. 

Consider now the words in (6a), repeated here as (13). Each has a nasal 
vowel which alternates with an oral vowel (when the schwa morpheme is 
added, as in (6b)). We assume that, for each word, the stem is phonologically 
identical in both alternants, and that this stem has a final consonant. Here, 
phonological forms are shown within slashes. 


(13) brun  /bsyn/ [bce] ‘brown (masc.)’ 
chien /fjen/ [HE] “dog (male)’ 
tient /tjen/ [tj] ‘(he) holds’ 
tiens /tjen/ [tj] ‘(I)bold (indic.)’ 


From the preceding discussion it follows that forms such as /bgyn/ have 
structures in which the highest base element is p-licensed by the UHP, so its 
vocalic portion is not phonetically realised. (In traditional parlance, the final 
consonant precedes a silent ‘nucleus’.) This is shown in (14a). 


NASAL VOWELS IN FRENCH: A PRECEDENCE-FREE APPROACH 


247 


248 


(14) a. /beyn/ b. [bse] 


IAI" (=F) NG 
ao OSC Eo mo 
PO ALEN) 
mal 
mil 
lb wy n/ [b x &] 


Also, recall from the section ‘Representing FNVs (derivational approach)’ that 
French has a consonant deletion rule which ‘deletes the onset position in the 
final syllable of a word’. In PfP terms, this involves removing the consonant 
expression that is a dependent of the highest base element, as in (14b). When 
structure is deleted in this way, we expect all of its associated elements to be 
lost. But in the case of nasal consonants, |L| (nasality) remains and is realised 
by merging with the stem vowel, as indicated by the arrow in (14b). 

A perceptual reason can be offered for why |L| survives, given that 
nasality is a defining property of the original /n/, and by extension, of the entire 
stem. If nasality were lost entirely, this would impact the perceptibility of the 
masculine form brun. Moreover, the importance of nasality to the identity of 
/n/ is reflected in phonological structure. In PfP, elements combine by forming 
chains of head-dependent relations, resulting in hierarchical structures of the 
kind in (14). At the top of each vertical structure is a base element, which has 
a prosodic role, that is, important for structure-building but not for expressing 
contrasts (since its acoustic properties are overridden by those of dependent 
elements). On the other hand, elements positioned lower in the hierarchy show 
the opposite characteristics, that is, unimportant for structure (for example, 
they do not support dependents) but important in melodic terms because they 
represent salient acoustic properties (Nasukawa 2017b; Backley 2021). In 
other words, the more deeply embedded an element is, the greater its acoustic 
prominence and its linguistic significance will be. It is no coincidence that |L| 
is the terminal element in the structure for /n/ in (14): as the lowest element in 
the hierarchy it is a defining property of nasal consonants. 


FNVs (oral vowel alternant) 


Consider now the words in (6b), repeated here as (15). They have the same 
stems as the words in (14) but are suffixed with the schwa morpheme 
to express feminine gender in brune and chienne, plural in tiennent, and 
subjunctive mood in tienne. 
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(15) brune /bsyn/ + /9/ [bsyn] “brown (fem.)’ 
chienne /fjen/+/o/ [fjen] ‘dog (female)’ 
tiennent /tjen/+/ə/ [tjen] ‘“(they) hold’ 
tienne /tjen/+/ə/ [tjen] ‘(D hold (subjunctive)? 


These suffixed forms have the oral vowel alternant shown in (5b), so a phono- 
logical analysis must explain why a VN sequence (cf. V) is pronounced in the 
context of the schwa morpheme. We propose that VN in [bgyn], [tjen], etc., 
results from the way morphological concatenation operates between stem and 
suffix. Following Nasukawa (2020), we claim that when a suffix is added it 
becomes the head of the new morphological domain. So the schwa morpheme 
in (16) occupies the highest level in the concatenated structure. Note that the 
schwa morpheme itself is melodically empty and phonetically silent: it is 
empty because it has no lexical melodic properties (/a/ corresponds to a bare 
base element |A]), and it is silent by virtue of the UHP (being the highest base 
element in the morphological domain, |A| is p-licensed). 


(16) /bsyn + 9/ [bsyn] 


7 IA] } <— schwa morpheme (silent) 
es | m final stem vowel (silent) 
i Ar 
A (Al) 
H PI < final stem consonant [n] 
ILI M 


[bx yn] 


In (16) there are two vocalic base elements (circled) with no dependent vowel 
elements. And because both are highest in their respective morphological 
domains (that is, stem, suffix), both are silent following the UHP. (Informally, 
each one is a final empty nucleus.) So, when the suffixed form is pronounced 
it ends phonetically in a consonant [n]. Unlike in (14b), however, [n] is not 
targeted by the consonant deletion rule as it is no longer ‘word-final’, that is, 
not a dependent of the highest base element in the word. This is because, after 
suffixation, the highest base element is that of the schwa morpheme. Nasality 
is therefore realised on the consonant, and there is no need for it to migrate to 
the stem vowel. The result is an oral vowel in (16) which alternates with the 
nasal vowel in (14b). 
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The mora nasal in Japanese 


Finally, we observe a parallel between the French nasal vowel in (14b) and 
the Japanese mora (or syllabic) nasal. Although the literature on Japanese 
phonology has not made much of the idea that Japanese could have nasal 
vowels,!? it has been noted in the element-based literature that similarities 
do exist between nasal vowels and syllabic nasals (Yoshida S. 1991, Yoshida 
Y. 1995, Youngberg 2021). From a PfP perspective, the two have much in 
common both structurally and phonetically — so much so, that we propose to 
treat them as equivalents. Following Nasukawa (2004) and Nasukawa and 
Backley (2017), the Japanese mora nasal is represented as in (17). In hon [hd] 
‘book’, the vowel and the mora nasal w merge phonetically to produce the 
nasal vowel [6] (Yoshida 1996). This echoes the merging effect observed in 
French, as shown in (14b). 


(17) hon [hd] ‘book’ 
JUJ” 


ee | 


|U lU} 

a] 
ul 

a 


| 
IL 


In the Japanese case, the mora nasal w is typically described as ‘placeless’ 
because it has no lexical place specification. Accordingly, it is represented 
as a simplex structure containing only |L| for nasality. Yet this renders it 
highly unstable as a consonant — in general, nasals seem to require a place 
specification in order to be realised as consonants. So, as an avoidance 
measure |L| merges with the vowel o and the two are realised simultaneously 
as [0]. 

This analysis of the mora nasal allows a simplified explanation of certain 
phonological patterns in Japanese. One concerns a dialect difference between 
standard (Tokyo) Japanese and two regional dialects, Oogami and Kagoshima. 
Nouns which end in the sequence nw (for example, inu ‘dog’) in Tokyo 
Japanese are realised with a mora nasal [n] in the Oogami and Kagoshima 
dialects (for example, in ‘dog’). This is usually explained by saying that the 
final vowel w is lost and the dental nasal n changes into a placeless mora nasal 
N. In PfP, however, it can be captured by referring to the same deletion rule 


ELEMENTS, GOVERNMENT, AND LICENSING 


that was proposed for French and illustrated in (14b), where the place element 
|I| and the occlusion element |?| are suppressed. (18a) shows the structure for 
Tokyo Japanese, while (18b) represents the non-standard form. 


(18) a. inu /inu/ [inu] b. in /in/ [itt] 
y y 
|U] |U] |U] Ju] 
— | 
I U] [| |U ql [U] [U] 
=] 
i [2] 
IL| H IL] 


[i n w] [i tw | 


What happened in the Oogami and Kagoshima dialects, shown in (18b), is 
similar to the historical process which led to the emergence of FNVs, shown 
in (14b). In Japanese, consonant deletion took place but again |L| survived, 
perhaps for the same perceptual reasons that were mentioned above. And 
importantly, the remaining structure comprising the base element |U| and 
nasal |L| is almost identical to the structure of the mora nasal in (17).'4 In 
Japanese, this structure can be interpreted as it is — unlike in French, there 
is no need for |L| to merge with a neighbouring vowel. The resulting sound 
is [tt], a nasalised reflex of the default vowel [w]. Although this sound is 
conventionally described as a mora nasal, we redefine it here as a nasal vowel 
[ti]. After all, it has the phonological structure of a nasal vowel as given in 
(18b), and when pronounced it has the phonetic characteristics of a nasal 
vowel. 


Summary 


We have analysed familiar data in an unfamiliar context, the representa- 
tional approach called PfP. In some respects, PfP may seem unorthodox, 
yet it reveals how a small set of elements, together with a standard notion 
of head-dependency, has the potential to integrate melody and prosody 
into a single representation and offer new insights into the nature of 
the phonological grammar. We look forward to seeing further develop- 
ments in the way elements can enrich our understanding of phonological 
patterning. 


NASAL VOWELS IN FRENCH: A PRECEDENCE-FREE APPROACH 


251 


252 


Notes 


Wii 


12, 


13; 
14. 


This work was supported by the following MEXT/JSPS KAKENHI grants: Grant-in-Aid for 
Scientific Research on Innovative Areas #4903 (Evolinguistics) Grant Number JP20H05007, 
Grant-in-Aid for Scientific Research (S) Grant Number JP19H05589 and Grant-in-Aid for Scientific 
Research (A) Grant Number JP19H00532 and Grant-in-Aid for Scientific Research (C) Grant 
Number JP22K00513. In writing this chapter, we hope to express something of the considerable 
intellectual debt we owe to Monik Charette. She was the first person to show us (both authors) 
that phonology could be different from the way it was presented in standard textbooks. And Monik 
managed to do this by rousing our curiosity, rather than by simply teaching us. The fact that we are 
still curious about phonology today says a great deal. 

The data presented here come from standard (that is, Parisian or Continental) French. For a discussion 
of the patterns in other varieties of French, see Charette (2003) and references therein. 

For discussion, see for example Ruhlen (1973), Hajek (1997), and Azra (2000). 

French also has a contrastive vowel /œ/, with which the schwa vowel overlaps phonetically. However, 
the two vowels differ in phonological behaviour (Anderson, 1982, 538), suggesting that they have 
distinct representations. 

Cross-linguistically, nasalisation occurs more readily on non-high vowels than on high vowels (Hajek 
1997, 116ff.). Reflecting this, the set of FNVs excludes the high vowels */i/, */¥/, and */ŭ/. This 
tendency also relates to the observation that nasalisation can have a lowering effect on target vowels, 
as appears to happen in (7b). This is not shown in the melodic representation of (7b), as lowering is 
not relevant to the present discussion. 

In another approach that also aims to eliminate representational redundancy, Raimy (2000) and 
Samuels (2009) represent phonological properties without referring to asymmetric relations (for 
example, dependency) between units. 

Harris (2009) argues that the carrier signal makes it possible for linguistic messages to be heard 
whereas the energy in a modulated carrier signal contains linguistically contrastive properties that 
enable listeners to recognise morphemes/words. Furthermore, Ohala (1992), Ohala and Kawasaki- 
Fukumori (1997), Traunmiiller (1994; 2005), and Harris (2006; 2009) claim that the carrier signal in 
speech manifests itself as a schwa-like quality characterised by an absence of converging formants 
in its periodic signature. According to Harris (2006; 2009), the size of the modulations of this carrier 
signal are then defined in terms of acoustic attributes such as periodicity, amplitude, spectral shape, 
fundamental frequency, and duration/timing. 
In PfP the |L| element represents two categories of sounds: when it cooccurs with |H] it is interpreted 
as obstruent voicing, but when no |H| is present the |L| element contributes nasality (Nasukawa and 
Backley 2018). 
To allow for the existence of a small number of languages (for example, Kaqchikel) where the 
preferred syllable type is VC rather than CV, it has been proposed that the Principle of Precedence is 
parametric (Nasukawa et al. 2019). 

This is an adaptation of the Ultimate-head Parameter introduced in Onuma (2015, 136). 

Expressed in PfP terms, ‘p-licensed’ describes an empty vocalic domain which is not phonetically 
realised. (An empty vocalic domain has a base (head) element but no melodic (dependent) 
elements.) 

Evidently, French changed its parametric setting for the BHP from OFF to ON before the start of the 
Modern French period. This is clear from Middle French forms such as Jande [lado] ‘moor’ in (1a), 
in which a default [9] (that is, baseline resonance) is pronounced. 

The exception here is Nakano (1969). We are grateful to one of the editors for pointing this out. 
Although (17) has an additional layer of structure containing |?l, this does not affect phonetic 
interpretation: as the base element of the consonantal domain, |?| functions as a prosodic unit (that is, 
an onset constituent) rather than as a melodic unit. 
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Recursive syllable structure in RCVP 
Harry van der Hulst 


Introduction 


In van der Hulst (2010) I suggested that a case can be made for recursion in 
syllable structure, a proposal that develops a suggestion in Smith (1999; 2003) 
and other work. This idea runs counter to the usual statement that there are no 
‘syllables inside syllables’, or that phonology at large is not recursive, except 
where it copies recursive syntactic structure. In van der Hulst (2010; 2020) 
I focus on recursion in the rhyme of the syllable, which allows embedded 
syllables, to capture trochaic and dactylic feet. In this chapter, I explore 
recursion in the onset, which will be justified with reference to ‘excessive’ 
consonant clusters that exceed the option of maximally having an obstruent 
followed by a sonorant consonant. I show that a dependency approach does 
not require the adoption of empty nuclei to accommodate excessive clusters, 
while it does explain the maximal complexity of such clusters. 


Syllable structure in RCVP 


Four core positions 


I will briefly discuss the way in which RCVP represents ‘core’ syllable 
structure (based on van der Hulst 2020; see van der Hulst 2021 for a synopsis). 
Faithful to the basic premise of RCVP, the syllable itself is a combination of 
the C and V units, which, if no further splitting applies, delivers the ‘strict? CV 
syllable structure that all languages have. If languages exceed this minimal 
CV syllable, this results from splitting the C and/or V unit, which produces 
binary branching onsets and rhymes, respectively: 
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(1) a. syllabic positions 


b. C CGV V:C V 


While the four-way division as such implies no linearisation (see the 
section ‘Linear order in the lexicon is limited’), when combined into a 
syllable structure, linear sequencing will be dictated by some version of the 
well-known ‘Sonority Sequencing Generalisation’, according to which the 
sonority level rises towards the nucleus and then descends: 


2) a. syllabic positions, linearised 
C v 
—. ees 
C v v C 
b r I m 
n we 
b r I m 


A proper dependency representation of a syllable structure that contains all 
four syllabic categories is as follows (adding convenient unit labels for each 
construction and for each of the four segmental positions, although the labels 
have no formal status: they are just used for convenience when I refer to 


syllabic units): 

(3) a. V SYLLABLE 
C ONSET V RHYME 
b i i mh 


b. EDGE BRIDGE NUCLEUS CODA 


& C C; V V V;C 


I assume a strict dependency model, which means that there is no ‘constituent 
structure’.! 
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In ‘classical’ Government Phonology (GP) (Kaye et al. (KLV) 1990), 
it was assumed that a syllable with four positions represents the universally 
maximal expansion of a syllable that can occur freely (that is, word- 
internally, as well as at the word beginning and end, although the edges 
may allow ‘extra segments’). This would once more seem to indicate that 
there is, in fact, a maximal two-way split that universally limits the degree 
of complexity within each domain, whether a segment-internal element class 
or a structural unit like the syllable. Whereas element groups represent para- 
digmatic dimensions and the syllables represent the syntagmatic dimension, 
the structural possibilities are the same, indeed structurally analogous. It is 
significant that both dimensions, whether paradigmatic or syntagmatic, are 
subject to the same system of categorisation as captured by the C/V syntax 
of RCVP. 

The four syllabic positions are very restricted in terms of which segment 
types can occupy them. In fact, the C/V encoding of the syllabic positions 
defines the major classes of the segments that they allow. The onset head (C) 
only allows obstruents, while the rhyme head (V) only allows vowels. The 
two dependent positions (C;V, V;C) are reserved for sonorant consonants. 
It is interesting that the C/V structure for the onset dependent is C;V, while 
the structure for the coda is V;C. This suggests that the onset dependent is 
‘stronger’, that is, more C-like than the coda position, which comes out as 
weaker (and more ‘sonorant’). That sonorant consonants are phonetically 
stronger in the onset than the rhyme is empirically shown by the difference, 
for example in English, between liquids in the onset and in the coda, where the 
latter have a much weaker constriction. The difference between the bridge and 
the coda position, apart from having an effect on the phonetic implementation 
of sonorant consonants, also provides a basis for the fact the latter often only 
allow a subset of sonorant consonants, such as the liquid. That said, the coda 
position can also be limited to a subset, such as only nasals; see van der Hulst 
(2020, §4.3.3).4 


Sonorant onsets and syllabic sonorants 


Let us now turn to the obvious point that it is not the case that only 
obstruents can appear in the edge position. Sonorant consonants can also 
form onsets by themselves and in that case, they occupy the edge position. 
Likewise, we must be able to represent syllabic consonants, which 
means that the nucleus must be able to contain consonantal segments. 
To address these necessities, in (4a) and (4b) I propose two different 
structures for sonorant consonants when they occur as onset heads and as 
onset dependents: 
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T V 
V sonorant consonant as onset head 
C|V 
b. V 
eae | 
C V 


sonorant consonants as onset dependent 


C; V 


In both cases V is dependent on C and yet the syllabic position will reflect the 
difference between V being subjoined or adjoined to C. In (4a) I represent the 
former structure notationally as C|V, indicating that V is subjoined to C, rather 
than being adjoined as in (4b). The essential difference between X|Y and X;Y 
is that the latter leads to a linear order of both nodes, while the former does 
not. 

Likewise, there are two different structures for syllabic sonorant 
consonants and coda sonorant consonants: 


(5) a. | 
dias V 


C sonorant consonant as rhyme head, 
that is, ‘syllabic sonorant consonant’ 
VIC 


C sonorant consonant as onset dependent 


V;C 


The distinction between subjunction and adjunction has already been used 
in the representation of the syllable as a whole (see 3), in which the V 
node is subjoined to itself so that the onset and the coda can be dependents 
at two different levels. I refer to Böhm (2018) for a discussion of the 
distinction between adjunction and subjunction and the legitimacy of both in 
a dependency approach.° 
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We have to recognise that some languages may impose restrictions that 
narrow down even this restricted set of options for each syllabic position. 
Indeed, there are languages that disallow high-sonority consonants (specifi- 
cally liquids and glides) in the onset head position; see Gordon (2016), Smith 
(2003), and Flack (2007). If, for example, only nasals are allowed as onset 
heads, we have to specify that only sonorant consonants that exhaustively 
contain C in their manner class are allowed, in other words, the most C-like 
sonorant consonants. 


Linear order in the lexicon is limited 


Anderson (1987) questions the necessity of specifying the linear order of 
segments in the lexicon, if segments are organised in a syllabic structure. The 
argument is that an onset that groups an obstruent and a sonorant consonant 
will necessarily order these two consonants in that order if we assume that 
linear order can be predicted from the Sonority Sequencing Generalisation. 
The same point applies to segments that form a rhyme. Golston and van 
der Hulst (1999) accept this point, stating that if lexical representations are 
syllabified, the linear order of segments is predictable. These authors cite 
various arguments for why syllable structure is present in lexical repre- 
sentations, both linguistic and psycholinguistic. In the RCVP model, I will 
therefore also assume that segments within a syllabic unit are not linearly 
ordered.’ The linear order of onset and rhyme can also be predicted given that 
the rhyme contains the sonority peak. Formally, the way in which the syllabic 
distribution of segments is stored amounts to specifying the syllabic affiliation 
for each segment in terms of a syllabic C/V specification and the unordered 
grouping of segments into onsets rhyme and syllable units: 


© C GV V V;C 
(œ r) @ m)) 


Given that major class distinctions are interpretations of syllabic positions 
and given that segments in the lexicon are syllabified — meaning they come 
with a piece of syllable structure — there is no need for an independent layer of 
major class specifications; the piece of syllabic structure that segments ‘wear 
on their sleeves’ is the major class specification. In the present framework, it 
does not make sense to think of the syllabic specification as being specified 
‘in the root node’ of segments (as proposed in McCarthy 1988), or elsewhere. 
Given that segment-internal manner specifications are the head of the 
segmental structure, it will be those specifications that are projected upwards 
and will be visible on the root node of the segmental structure, so that they can 
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interact directly with the syllabic affiliation specification. Thus, the syllabic 
specification is independent of the segmental structure and one can think of 
the metaphor ‘wearing on their sleeve’ as referring to an association relation. 


(7) C C;V V V;C (syllabic affiliation) 
b r I m (segmental structure) 


In RCVP, in conclusion, syllable structure is not a projection of segmental 
structure. Rather, it is specified on a separate ‘tier’ in association with the 
segmental structure. 


Empirical issues 


The proposal in the previous section makes very strong predictions about 
the maximal complexity of syllables and which segment types can occur in 
syllabic positions. Both onset and rhyme can be maximally binary branching. 
In addition, we disallow onsets consisting of two obstruents, two sonorant 
consonants, and sonorant consonant followed by an obstruent. We also 
disallow obstruents to function as nuclei and as codas. A similarly strict view 
was adopted by proponents of ‘classical’ GP. But this approach obviously 
faces a host of empirical problems. 

For example, this position requires a reconsideration of so-called 
syllabic obstruents in Tashlhiyt Berber (see Dell and Elmedlaoui 2002) and 
other languages (see Bell 1978). It also needs to deal with the fact that in 
languages such as English and Dutch obstruents would appear to occur in 
codas almost as freely as sonorant consonants. Finally, consideration should 
be given to numerous cases in which consonant clusters in apparent onsets 
are not a sequence of an obstruent followed by a sonorant consonant, but 
two obstruents (Morelli 1999; Kreitman 2008), or cases in which the number 
of onset consonants exceeds two, as, Polish (Cyran and Gussman 1999) and 
Georgian (Chitoran 1998; Toft 1999; Butskhrikidze 2002; Ritter 2006) 

Closer to home (for me at least), in English and Dutch, the left edge can 
have triconsonantal structures of a limited variety. They have to start with 
[s] followed by an obstruent + liquid cluster (see Trommelen 1983; van der 
Hulst 1984; Fudge 1987). On the right edge, we find so-called superheavy 
rhymes containing a tense vowel followed by a consonant, or a lax vowel 
followed by two consonants. Classical GP deals with such problematic cases 
by postulating empty nuclei, but van der Hulst (2020) argues against the use 
of empty structural positions (both segment-internally, such as ‘empty class 
nodes’, and in syllabic structure). In the following section, I will propose 


ELEMENTS, GOVERNMENT, AND LICENSING 


solutions for these types of empirical issues which will draw on allowing 
recursivity in onset structure. Unfortunately, I will not be able to offer detailed 
analyses for all of them. 


Recursive syllable structure 


Previous views 


In van der Hulst (2020) I discuss the possibility of recursion in syllable 
structure, here understood as the possibility of embedding syllables inside 
syllables; that is, self-embedding tail recursion. A commonly held view in 
theoretical linguistics is that the formal organisation of phonology is funda- 
mentally different from that of syntax, one manifestation of this idea being 
that recursion does not exist in phonology, except where phrasal phonology 
‘copies’ recursive syntactic structure to some extent. 


Recursion consists of embedding a constituent in a constituent of the 
same type, for example a relative clause inside a relative clause [...]. This 
does not exist in phonological structure: a syllable, for instance, cannot 
be embedded in another syllable (Pinker and Jackendoff 2005, 10). 


. syllabic structure is devoid of anything resembling recursion 
(Bickerton 2000). 


It should be noted that discussions about phonological structure and recursion 
are always carried out in a constituency-based approach. I have adopted a 
dependency approach which does not involve constituency. Nevertheless, it 
seems to me that the issue of recursion can also be addressed in a dependency 
approach. 

Itis well known that several linguists have pushed for structural analogies 
between syllable structure and sentence or phrasal structure (Kurylowicz 
1948; Pike and Pike 1947; Fudge 1987): 


(8) a. syllable b. sentence 
a A 
onset rhyme NP VP 
ANN aN 
nucleus coda V NP 


More recent claims to the same effect can be found in Levin (1985), Carstairs- 
McCarthy (1999), and Völtz (1999), among others. Whatever the merit of 
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these parallels, no mention is made in these works of a potential further 
parallelism that would involve recursion. Most writers, while acknowledging 
that phonotactic structure is constituency-based (and referring to an X-bar(ish) 
organisation of syllables), propose that phonological (often called ‘prosodic’) 
constituency is ‘strictly layered’, which means that no constituent contains a 
constituent of the same type. This explicitly bars (self-embedding) recursion. 
With reference to ‘higher’ phonological/prosodic structure, recursion has 
been recognised, but here it is then said to reflect the recursive structure of 
syntax, at least to some extent (Ladd 2008; Wagner 2005; van der Hulst 2010; 
Hunyadi 2010).? Limiting recursion in phonology to units that have morpho- 
syntactic structure is tantamount to saying that no recursion will be found 
within morphemes (or simplex words), where whatever structure exists cannot 
be a mapping from morphosyntactic structure. A recent collection of studies 
(Backley and Nasukawa 2020), however, tackles the question of morpheme- 
internal recursion head on. In this volume den Dikken and van der Hulst 
(2020) offer an extensive demonstration of parallels between morpheme- 
internal phonological and syntactic structure. 

However, some phonologists — whose proposals differ in several ways 
that will not concern us here — have argued that syllable structure can display 
recursion (Smith 1999, 2003; Garcia Bellido (2005); van de Weijer and 
Zhang 2008; van der Hulst 2010). The present chapter builds on van der Hulst 
(2010). 

The point of departure is the syllabic model that was introduced in (3). 
The crucial idea in van der Hulst (2010) is that the coda position can form an 
entire syllable, making the coda the recursive node in syllable structure, just 
like the complement position in an X-bar-type organisation. Adopting the C/V 
dependency notation, the proposal in van der Hulst (2010) is that the structure 
for the traditional notion is a trochaic foot in (9a) can be recast as the structure 
in (9b) with significant explanatory gain. This is illustrated with the Dutch 


word kdno ‘canoe’:!° 


(9) a. V (trochaic foot) b. V (recursive syllable) 
N 
V (syll) V (syll) C V (syll) 
C oOo o! _ 
k a n o k an o 
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The embedding of syllables inside syllables does not have to stop here. A 
full structure of a so-called dactylic ‘ternary foot’, sometimes referred to as a 
‘superfoot’ (as in English vanity), displays degree-2 embedding. 

(10) 


(recursive syllable, 2 degrees) 


y 
C 
t 


@stsnesee cesar sues aa 


y 
y 
æ 


5--------Q 


i 


This structure is a perfectly legitimate object also in, for example, English 
Winnipesaukee, hippopotamus. An interesting consequence of this proposal 
is that it is now immediately clear why in poetic rhyme the initial onset is 
ignored, but not the second (or indeed the third, in forms like sanitsy~vanity). 
The initial consonant of such structures is external to the whole sequence 
that forms the rhyming unit in the proposal made here. The recursive 
structures capture the special position of the initial onset, as opposed to 
the other more deeply embedded onsets (which must be identical). In the 
structures proposed here the rhyming subpart of the string forms a unit, which 
is not the case in a traditional foot structure in which all syllables are separate 
units, as in (9a). 


Recursive rhymes revisited 


This proposal in (9b) and (10) faces two problems that went unnoticed in van 
der Hulst (2010). Firstly, the recursive node (the coda) is a ‘bare’ C position in 
a ‘closed’ syllable that has no embedded syllable, while it seemingly must be a 
V position, when the coda is an embedded syllable. The second problem is that 
the matrix syllable can itself be a ‘closed syllable’, as in a word like [tempo] 
(in either English or Dutch), which would seem to imply ‘double occupancy’ 
of the coda position. 

To solve both problems, I propose in van der Hulst (2020, §3.2.5) 
that the embedded syllable is the ‘complement’ of the C node in the case of 
[tempo] (as in 1 1b), while it is subjoined to the C node in the absence of a coda 
consonant: 
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(11) a. V (‘foot’) b. V (‘foot’) 


v yv 
ee 1 
Cc on g Cc i 
os E 
' i Vv Pa V 
| T E a 
ko a € ' 
k a n o t e m p 0 


The embedded syllable is thus a C-type syllable, which accounts for its less 
prominent status in comparison to the head matrix syllable.!! 

Here I consider an alternative for (1la), while adding some further 
examples for (1 1b): 


(12) a. V (‘foot’) b. V (‘foot’) 
V V 
pe CP, 
C ' C l C 
' i V i V 
' ' ! y ' ' Pa 
C i i ! | C | 
k 2 no t £ m p ò 
l a —_<<<s o 
m E€ m >>>- o 


In (12a), the ‘>>>’ indicate ‘virtual gemination of the ‘long’ vowel. Under 
(12b) I also added the examples /asso and menno, which both have a virtual 
geminate for the so-called ‘ambisyllabic’ consonant (as proposed in van der 
Hulst 1985; 2003), albeit with different headedness. While the coda position 
in (3) calls for sonorant consonants, it was noted in the section ‘Empirical 
issues’ that in languages such as Dutch and English, obstruents can also make 
an appearance in a rhyme headed by a lax vowel. But if the syllabic position 
encodes the major class of segments, then the coda position simply cannot 
dominate an obstruent. I therefore locate the obstruent as an onset within the 
embedded syllable which produces a right-headed virtual geminate (even in 
word-final position): 
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(13) a. V (rhyme) b. V (thyme) 
en ae 


Č Č 
_ Ș 
! i if i 
C 
k a <<<} ‘cat’ k a m ‘comb’ 


When a lax vowel is followed by a sonorant consonant, this consonant can 
occupy the coda position. When there is a following vowel, this creates a 
left-headed virtual geminate, as shown in (12b). In van der Hulst (2008) it 
is shown that diminutive allomorphy in Dutch is different for syllables with 
a lax vowel ending in a sonorant consonant or in an obstruent: katje versus 
kammetje, with the syllable ending in a sonorant taking the so-called long form 
-etje, see van der Hulst (2008, 1,295) for details. 

The difference between the stressed vowel in kano and lasso is that 
the former is a tense [a], which can occur in an ‘open syllable’, whereas the 
latter is a lax [£], which requires a closing consonant which is the left half of a 
virtual geminate. With respect to stress, tense vowels behave as light, whereas 
lax vowels (occurring in a close syllable) are heavy; see van der Hulst (1984). 

We can extend the degree of embedding and accommodate more cases, 
all with initial stress. The following list of Dutch words is representative of the 
various syllable structures that need an account: 


(14) a. lasso ‘lasso’ 
b. menno (proper name) 
c. kano ‘canoe’ 
d. tempo ‘tempo’ 
e. la ‘drawer’ 
f. lam ‘lamb’ 
g. raam ‘window’ 
h. sambal “Indonesian spice’ 


dominee ‘minister’ 


jais 


almanak ‘almanac’ 
atlas ‘atlas’ 
spelonk ‘grotto’ 
grotesk ‘grotesque’ 


z E 
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(15) V (rhyme) 
Pk 


o 
' | V (embedded syllable) 
pe: 
i 4. | 
: ' 3 l Pay (embedded syllable) 
Irag l 
SERE 
a la >«s ò no i (virtual right-headed c:) 
b me n>>- o E ' | (virtual left-headed c:) 
ce k aè- n o ' © i i (virtual v:) 
d tem p o ioin | 
e Dare | ' ' © | | (virtual v:) 
ff lam : ' i he (final coda) 
go r aè- m ' : | | (virtual v:, stranded onset!?) 
h sam b a l : : |< (final coda) 
i d om- m io>-3n e | (virtual v:) 
I: a | m a>>— n a k (stranded onset) 
k. a 4*4 | a s (virtual v:, final coda) 
l. spl e>- 1l >. gk (virtual v:, stranded onset) 
m. gr oœ- t € sk (virtual v:, stranded onset) 


It would seem that the coda unit (the ‘complementiser’) can have a complement, 
but there appears to be no ‘need’ for a specifier. However, this may be a 
limitation for internal specifiers only, if we allow a specifier position at the 
highest level (not indicated in (15)) for ‘rising’ diphthongs like [ja, wo], etc. 
Perhaps this option can also be used for so-called short diphthongs which 
are also typically rising. If, then, only internal specifiers for codas are not 
needed, this is perhaps just an instance of dependent units displaying a simpler 
structure than head units; see Dresher and van der Hulst (1998). 


Recursive onsets 


In this section I will propose that the notion of recursivity can also do work for 
us in the onset unit of the syllable.!° For onsets I propose the following, which 
is largely parallel to the rhyme structure in (15). For starters, for the onset we 
could have a (word-initial) specifier position for, among others, [s]: 
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(16) Examples of word-initial onset clusters!” 


a. 


roa mo ae S 


pl 
sp 
spl 
sl 
kn 


pt 


zdzbw 
g’prek’n 


mts’ ’rtn 


crm 


(Dutch: plek ‘place’) 
(Dutch: spin “spider’) 
(Dutch: splinter ‘splinter’) 
(Dutch: slak ‘snail’) 
(Dutch: knie “‘knee’) 
(Hungarian: ptozis ‘ptosis’, dzéta “zeta’) 
(Polish: zdzbwo ‘blade [of grass]’) 
(Georgian: gwprckwni ‘you peel us’ 
mts ’wrtneli ‘trainer’) 
(sesquisyllables in Kammu: [cr.mool] ‘sowing season’) 


The structure in (17) can accommodate all cases in (16):!8 


(17) 


ro moe ao gp 


-0 HW WH NM------ 


N==s6 


C 
C 


(onset) 


e (By B tg to: eE 


V 


"a—Ù (embedded onset) 


:<—C (embedded onset) 


(normal complex onset) 
| (sp clusters) 

! ' (spl clusters) 

© (sl clusters) 

(obstruent - nasal clusters) 
t ' : (Hungarian clusters) 
(Polish clusters) 
(Georgian clusters!°) 


O 
Q 
< 
@ © ------- 
=) 


rm (sesquisyllables in 
Kammu) 


Again, we do need an internal specifier for C except at the highest level, so 


maybe this is a general constraint. But V also does not have/need an (internal) 


specifier, due to reduced complexity in dependent units, of which the onset is 


itself a case. 


The structure in (17) suggests an appeal to empty nuclei, which van der 
Hulst (2020) has tried to ban. However, empty nuclei can be circumvented by 
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subjoining V under C (in a sense making a ‘vertical’ CV unit) as indicated by 
the left-pointing arrow. 


lambic feet 


The proposal for recursion in syllable structure provides an alternative to the 
representation of so-called trochaic (SW) and dactylic (SWW) feet that has 
been proposed in Metrical Theory. This theory has also made use of iambic 
feet and various proposals for a complete foot typology have been proposed. 
Here I will not discuss how RCVP can be applied to ‘iambic’ foot types, apart 
from saying that as proposed in van der Hulst and Ritter (1998) so-called 
iambic feet can be analysed as super syllables, with a C adjunct to the left of 
the head syllable: 


(18) y (‘iambic foot’) 
C y 


v E y 


This then is more like the original iambic foot structure, although I would 
argue that the unit as a whole is just another kind of extension of core syllable 
structure, in this case with a left-adjunction structure, as suggested in den 
Dikken and van der Hulst (2020, §4.3). 


Why recursion is less pervasive in phonology 


The central thesis of van der Hulst (2010, 2020, and much previous work), 
following Dependency Phonology’s notion of Structural Analogy, is that 
phonology and syntax have recourse to the same computational system, that 
is, that both modules are maximally analogous. This thesis goes beyond the 
claim that both phonology and syntax build hierarchical structures. This claim 
is commonly made (though not supported by all linguists) with the proviso 
that the nature of the hierarchical organisation is fundamentally different 
with phonology adhering to ‘strict layering’, while syntax displays recursive 
structure. Accepting that recursion is available to phonology does not entail 
that phonology will display the same amount of recursive structure as morpho- 
syntax. The kinds of structures that are employed in both modules do not 
exist in a vacuum, but rather are formed to accommodate the substances that 
these structures are grounded in. There is a simple reason for why recursion 
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in phonology is less pervasive. If we accept the fact that semantic, conceptual 
structure (Anderson would say ‘conceptual substance’) is inherently recursive, 
we expect morphosyntax to be isomorphic to this semantic, conceptual 
structure as much as is possible. Phonological structure accommodates 
phonetic-perceptual substance, which arguably is not inherently recursive. 
Rather, as the result of motoric actions, it is essentially sequential and 
rhythmic. This may lead to the view that phonology is ‘flat’ (see Scheer 2013), 
perhaps only displaying recursion when expressions are morphosyntactically 
structured. But recursion in phonology is limited even in this case because 
there is a ‘flattening force’ that causes disrhythmic structures that contain 
lapses (sequences of weak units, ‘SWWW...’) to flatten by breaking up 
into smaller rhythmic units (that is, SW SW), as shown in Giegerich (1985). 
This in itself shows that phonological structure is not entirely flat. After 
all, if there is rhythmic structure this means that the units (syllables, words, 
etc.) display a structure in which certain units are ‘subordinated’ to others. 
Standard metrical phonology has chosen to formally represent this ‘subordina- 
tion’ by grouping units into binary, headed constituents. The crucial point of 
van der Hulst’s (2010) proposal was that subordination can also be encoded 
in terms of embedding, which then establishes a perfect formal parallel 
with recursion in syntax. The same flattening force that limits phonological 
recursion in morphosyntactically complex expressions, also prevents more 
than three levels of recursion in monomorphemic units. A sequence of four 
syllables is therefore not structured as a quaternary ‘foot’ which would have a 
dysrhythmic sequence ‘SWWW’ that does not match the rhythmic structure of 
a quadrisyllabic sequence. Indeed, a string of four CV units is likely to display 
an alternating rhythmic structure (SWSW), which suggests the presence in 
the structure of two consecutive units, each with level embedding. Beyond 
the ‘magic number’ 3, unbounded recursion gives in to rhythm in phonology. 


Conclusions 


In conclusion, the argument that recursion is unique to morphosyntax is not 
compelling. It is a manifestation of a syntactico-centric way of thinking that 
a computational device that the human mind has would only apply within 
one component of the grammar. It is simply not reasonable to claim that the 
mental power to combine units that themselves are the product of combining 
(thus allowing self-embedding as a logical option) is limited to grammar, 
let alone to one component in the grammar. In this chapter, I have shown 
that by allowing recursion in syllable structure, we can account for recurrent 
phenomena that involve the phonological dimension of language. Recursivity 
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in the rhyme provides a rationale for the type of poetic rhyme that refers to 
the stressed vowel and everything that follows in a ‘foot’, but traditional foot 
structure provides no formal basis for the rhyming sequences. What I have 
added in this chapter is the idea that recursivity can also be applied to onset 
structure, allowing precisely the maximal complexity that has been observed 
for languages with excessive onsets (such as Polish and Georgian), although 
a more detailed account of these and other cases is certainly called for. I 
conclude that recursion in phonology is real, and its results are revealing. 


Notes 


1. [here ignore the claim in GP that the syllable unit as such is not required. Rather, in GP there is a lateral 
licensing relation between the onset and the rhyme unit which requires them to always occur together. 
Perhaps there is a resemblance between seeing all syntagmatic relations in terms of dependency and 
seeing them in terms of lateral licensing, as discussed by Scheer (2013), among others. 

2. These extra segments of course need an account. GP opts for allowing extra syllables with empty 
nuclei. I propose an alternative account in the section ‘Recursive onsets’. 

3. Here I will not discuss the RCVP theory of segmental structure; see van der Hulst (2020) for a full 
account and van der Hulst (2021) for a summary. 

4. Fora discussion of the representation of long vowels, diphthongs, and geminates, see van der Hulst 
(2020, § 4.5). 

5. Iwill also invoke this distinction in the section ‘Recursive onsets’, where I discuss recursion in onset 
structure. 

6. In the syntactic plane, Anderson (2011) uses the subjunction structure to represent category 
conversion in the lexicon, for example, from noun to verb or vice versa. 

7. Iwill leave open here whether we can extend the elimination of linear order above the level of the 
syllable, although it is easy to see that syllables forming a foot can also be ordered, given that the foot 
type is fixed for the language as a whole. 

8. In English and Dutch, a further word-final syllabic unit (called the ‘appendix’) is possible. I will not 
discuss this unit, an additional rhymal adjunction, here (see Fudge 1987; van der Hulst 1984). 

9. The claim of strict layering which prevents recursion was ‘officially’ abandoned in later work in 
prosodic phonology, albeit that the driving force behind recursive structures in prosody is still 
morphosyntax (see Selkirk 2011). 

10. The representation of both syllables requires a V node in (9a), which seemingly goes against the 
expectation that V heads take C dependents, and vice versa. The same issue arises in (9b). I address 
this issue below. 

11. Several other ‘solutions’ to the problem of double occupancy are discussed and rejected in den 
Dikken and van der Hulst (2020), which explore a somewhat different development of van der 
Hulst (2010). This article proposes to ‘enlighten’ phonology with the introduction of a parallel to the 
syntactic notion of ‘light v’, an idea that is not adopted here. 

12. Iam here accepting the proposal in Polgardi (1998) that word-finally a ‘stranded’ onset is permitted. 

13. The virtually long versions of tense vowels are only required in stressed syllables, which must have a 
branching rhyme as proposed in Trommelen (1983) and Lahiri and Koreman (1988). This means that 
the second unstressed open syllable in (15i and 15j) does not need a virtual geminate vowel. Instead, 
the following (also unstressed) syllables can be subjoined under the preceding C position, which is 
indicated by the left-pointing arrow in the tree structure in (15). 

14. Note that I have placed an obstruent in coda which overrides the prohibition. This is due to the fact 
that lax vowels are subcategorised for occurring with coda consonant. 

15. Here the occurrence of [s] in the coda position is strictly speaking irregular given that this position is 
reserved for obstruents. The reason for allowing this obstruents is as in the previous footnote. 

16. Here I develop ideas that were presented in a talk at The form of structure, the structure of form — 
Three days of linguistics for Jean Lowenstamm, January 15-17, 2015, University Paris Diderot — 
Paris 7. 
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17. Dutch (16a-e): Trommelen (1983); van der Hulst (1984). Hungarian (16f): Térkenczy and Siptar 
(1999). Polish (16g): Cyran and Gussmann (1999). Georgian (16h): Toft (1999); Butskhrikidze 
(2002); Ritter (2006). Kammu (16i): Svantesson (1983); van der Hulst and Ritter (1998). 

18. I suspect that the word-initial vowel in Arrernte (Green 1999) may also find an account in this onset 
model, with the vowel occupying the specifier position and the following consonant(s) being the 
onset, which Green argues to be missing in this language. 

19. The unit ‘ck’ is a so-called harmonic cluster which I here analyse as a complex segment 
(Butskhrikidze 2002, 103). The written <w> is analysed as a secondary articulation (Butskhrikidze 
2002, 93). 
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